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ABSTRACT 


The  problem  is  that  of  optimally  testing  a  coherent  system 
to  learn  some  characteristic  of  it,  for  example,  whether  it 
is  operating  or  not.  A  branch  and  bound  and  a  dynamic 
programming  solution  are  given,  as  well  as  a  comparison  of 
computer  computation  times  for  both.  Several  specific 
models  with  analytical  solutions  are  also  presented. 

The  general  problem  is  posed  abstractly  in  Chapter  1,  and 
two  solution  methods  are  detailed.  Briefly,  we  are 
presented  with  n  binary-valued  random  variables  and  a 
function  of  the  vector  of  these  random  variables.  The 
object  is  to  learn  the  value  of  the  function  by  testing 
some  of  the  random  variables  and  an  optimal  testing  policy 
uses  the  minimum  average  time  to  complete  its  testing.  The 
first  solution  method  given  is  a  dynamic  programming  type  as 
this  is  easier  to  formulate.  The  second  is  a  branch  and 
bound  method;  its  description  is  somewhat  more  involved. 
Chapter  2  is  concerned  with  formulating  general  fault 
finding  models  which  can  be  cast  as  problems  solved  by  the 
methods  presented  in  Chapter  1.  Also  included  is  some 
computational  experience  comparing  the  effectiveness  of  both 
solution  methods  for  one  of  the  models  described.  Chapter  3 
gives  several  specific  fault  finding  problems  and  analytical 
results  for  them.  This  chapter  is  developed  independently 
of  Chapter  1,  and  relies  on  Chapter  2  only  for  its  initial 
discussion  of  coherent  systems. 
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CHAPTER  1 


DECISION  PROBLEMS 

Introduction 

Decision  problems  occur  when  one  is  presented  with  the  task  of  de¬ 
termining  the  value  of  a  known  function  of  n  random  variables,  in  the 
following  circumstance.  We  suppose  that  the  n  random  variables  are 
binary,  each  one  taking  the  value  zero  or  one.  There  is  a  time  t.  £  0 
incurred  to  determine  the  value  of  the  ith  random  variable.  We  remark 
that,  of  course,  t.  could  be  interpreted  as  any  other  measure  of  the 
cost  to  determine  the  value  of  the  ith  random  variable.  A  procedure,  or 
policy,  for  determining  the  value  of  the  known  function  is  a  rule  which 
tells  its  user  which  random  variable  to  test  next,  based  on  the  results 
of  the  previously  made  tests,  or  to  Stop,  if  the  known  function  can  have 
only  one  value,  almost  surely,  given  the  results  of  tests  made  up  to 
this  point. 

When  the  function  is  one  to  one,  determining  its  value  is  equivalent 
to  determining  the  value  of  every  random  variable.  In  general,  however, 
the  function  is  many  to  one.  The  collection  of  inverse  images  of  each 
functional  value  forms  a  partition  of  the  sample  space;  determining  the 
function's  value  is  then  equivalent  to  identifying  the  set  of  the  parti¬ 
tion  in  which  the  vector  of  random  variables  resides. 

There  are  various  criteria  for  comparing  policies;  we  will  deal  pri¬ 
marily  with  the  expected  time  until  the  testing  process  stops  and  the 
value  of  the  known  function  has  been  determined  almost  surely.  A  straight¬ 
forward  dynamic  programming  solution  and  a  branch  and  bound  type  solution 
are  presented,  as  well  as  an  example  illustrating  the  branch  and  bound 
algorithm.  The  chapter  is  concluded  with  a  brief  discussion  of  another 


criterion. 
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The  generality  of  this  model  of  decision  problems  suggests  that  a 
wide  variety  of  real  problems  can  be  formulated  in  these  terms.  One 
such  problem  is  that  of  converting  "decision  tables",  a  format  for  computer 
programming,  into  a  set  of  machine  instructions  logically  equivalent  to 
the  given  decision  table.  See,  for  example,  [Reinwald  and  Soland,  1966] 
and  [Reinwald  and  Soland,  1967].  Some  fault  finding  problems  can  also  be 
formulated  this  way,  which  is  the  topic  of  Chapter  2.  A  frequent  sugges¬ 
tion  as  a  real  application  is  clinical  diagnosis. 


A.  Definitions  and  Notation 


We  say  (Yj . Yn  ;  A | . A^)  is  a  n-th  order  decision  problem 

when: 

(i)  Y, . Y  are  binary  (0  or  1)  random  variables 

i  n 

and 

(i  i)  [Aj  ,  ....  Arf]  is  a  partition  of  £0 ,  l]n  ■  £  (uu^ . tu^)  |  uu. 

■  0  or  1  ,  i  ■  1  ,  . . . ,  n)  . 

The  correct  action  is  k  ,  a  random  variable,  defined  by  (Yj . Y^) 

€  A^  almost  surely.  We  put  Y  *  (Yj . Yr)  and  write  Y  €  a.s. 

The  object  is  to  determine  the  correct  action  almost  surely  in  minimum 
expected  time  by  successively  determining  the  values  of  the  Yj's.  The 
"correct  action"  should  be  thought  of  as  the  value  of  the  function  men¬ 
tioned  in  the  introduction  and  should  not  be  confused  with  a  policy  for 
determining  it.  This  terminology  is  motivated  by  applications,  such  as 
in  Chapter  2. 


To  determine,  or  test,  Yj  requires  time  tj  2  0  .  The  process 
stops  when  we  are  almost  sure  of  the  value  of  random  variable  k  .  We 
say  the  state  of  the  problem  is  s  3  (Sj  ,  . . . ,  s  )  €  {-1  ,  0 ,  1  ]n  when , 


if  s.  3  -1  then  Y.  has  not  been  tested,  while  if  s.  3  0  or  I  then 
J  J  J 
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Yj  has  been  tested  and  its  value  is  Sj  .  In  particular,  set  s  = 

(-1 . -I)  ,  the  initial  state  when  no  Yj  has  been  tested.  We  de¬ 

fine  s  «  {Y j  =  Sj  |  j  3  Sj  ¥  -1}  ,  the  event  In  the  underlying  probability 
space  corresponding  to  the  state  of  knowledge  s  . 

A  policy  for  (Yj  ,  . . .  ,  Yn  ;  Aj  ,  ....  A^)  is  a  rule  which  tells  its 
user  which  Yj  to  test  next,  based  on  the  state  of  the  problem.  Speci¬ 
fically,  to  use  policy  tt  when  in  state  s  means  to  test  Y^^j  ,  if 

n(s)  =  I,  2 . n  ,  and  to  stop  if  tt(s)  =  0  ,  indicating  the  correct 

action  is  almost  surely  determined.  We  find  it  formally  convenient  to 
require  that  policies  stop  at  states  s  for  which  P(s)  *  0  ,  and  at 
those  states  when  the  correct  action  is  known,  almost  surely.  Accordingly, 
3.  Pol  icy  will  be  any  function  rt  on  {-1  ,0,1}°  to  [0,  I  ,  . . .  ,  n] 

such  that  for  states  s  *  (s, ,  ...,  s  )  . 

i  n 

(1)  P(s)  =  0  ->  tt(s)  -  0 


(2)  P(s)  >  0  -> 


(a)  tt(s)  -  0  <=>  3  k  3  P[Y  6  A  |  s}  =  1  and 


^(b)  tt(s)  -  j  >  0  »>  Sj  ■  -1  . 

We  will  denote  by  T^s)  the  (conditional)  expected  time  to  deter¬ 
mine  the  correct  action,  beginning  in  state  s  and  following  policy  rr  . 
Specif i cal  ly , 


»  expected  time  until  process  stops,  beginning 
In  state  s  and  using  policy  n  ,  with  respect 
to  the  conditional  distribution  P{-  |  s}  , 

-  0  if  P(s)  -  0  . 


There  are,  of  course,  various  criteria  for  comparing  policies.  Our 
interest  here  is  in  the  expected  time  to  determine  the  correct  action, 


beginning  in  the  initial  state  s  .  Accordingly,  we  will  call  policy 
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ttq  optimal  if  it  minimizes 


Tn‘s°> 


among  a  1 1  pol icies. 


fnamlc  Programming  Solution 


Let  us  enrich  the  problem  by  assuming  the  time  to  test  Yj  depends 
on  the  state  s  of  the  problem  when  the  test  is  performed,  say  t(j,  s)  . 
This  being  the  case,  we  have  for  all  states  s  3  P(s)  >  0  ,  for  all 
policies  tt  3  n(s)  -  j  >  0  ,  Tn(s)  -  t(j,  s)  +  P{Y^  =  0  |  sjT^s')  + 

P CYj  =  1  |  s}Tn(s")  ,  where  s‘  ■  (Sj . Oj ,  -  - . ,  sr)  and  s"  = 

( S| . 1. . s^)  .  This  provides  the  basis  for  the  usual  dynamic 

programming  solution  to  finding  an  optimal  policy.  Let  the  optimal  value 
function  be  T(s)  *  Min  (Tn(s)  |  policies  tt)  .  Then 


0  if  3  pol  icy  tt  3  tt(s)  ■  0  ,  i.e.  ,  which  stops  at  s  ; 
otherwi se 

Min  (t(j,  s)  +  P{Y.  -  0  |  s}T(s ')  +  P[Y  -  I  |  s)T(s")) 
j3Sj— 1  J  J 


where  s'  *  (s,  ,  . . .  ,  0 . s  )  and  s"  *  (s. . I . 

I  j  n  l  j 


This  functional  equation  can  be  solved  by  considering  states  s  with 
successively  more  occurrences  of  -I.  An  optimal  policy  is  also  generated 
in  the  usual  way. 

The  primary  restriction  to  using  this  solution  is  that  the  function¬ 
al  equation  must  be  used  once  for  each  state,  indicating  that  the  compu¬ 
tational  complexity  of  the  algorithm  grows  roughly  as  1° .  On  the  other 
hand,  by  making  these  computations  we  have  really  solved  a  family  of 
related  problems,  each  one  an  extension  of  the  original.  This  bonus  is 
a  common  occurrence  in  the  solution  of  problems  via  dynamic  programming. 


The  extension  is  simply  the  assumption  that  the  cost  of  determining 


Y. 

J 


can  be  allowed  to  depend  on  the  tests  previously  made  and  their  results, 


namely  the  current  state  of  the  problem.  Also,  we  see  that  an  optimal 
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policy  ttq  recovered  from  the  functional  equation  above  really  satisfies: 
Vs  :  T  (s)  *  Min  T  (s)  .  That  is,  n  is  optinal  uniformly  for  be- 

TT  TT  O 

o  rr 

ginning  in  any  state  s  .  This  is  stronger  than  simply  requiring  it 
minimize  T^ (s°)  .  If,  during  the  use  of  this  optimal  policy,  some  more 
information  were  to  become  available,  then  the  problem  need  not  be  re¬ 
solved  for  a  new  initial  state;  one  can  continue  to  use  the  optimal  policy 
generated  by  the  functional  equation,  as  it  will  be  optimal  for  this 
newly  created  problem. 

C.  Branch  and  Bound  Solution 

The  decision  problem  has  a  "branch  and  baind"  solution  which,  like 
any  solution  of  this  type,  is  potentially  an  enumeration  of  all  feasible 
solutions.  Such  complexity  is  usually  not  realized  and  this  solution 
provides  a  reasonable  alternative  to  the  dynamic  programming  solution. 

In  order  to  describe  the  branch  and  bound  scheme  precisely,  the 
notion  of  a  policy  tree  will  be  defined.  Roughly,  a  policy  tree  is  a 
partial  specification  of  a  testing  policy.  For  example,  the  policy  tree 
((s°,  ')}  is  the  partial  specification  tt(s°)  *  i  ,  so  we  say  this  policy 
tree  represents  all  policies  which  begin  by  testing  Y.  first  (in  state 
s°).  Thus,  policy  trees  become  the  device  for  "branching"  at  each  step. 
The  "bounding"  of  policy  trees,  that  is,  a  lower  bound  on  Tn(s°)  for 
all  n  represented  by  some  policy  tree,  is  obtained  from  a  lower  bound 
on  the  probability  that  each  Yj  will  be  tested.  An  appealing  feature 
of  the  bound  obtained  occurs  when  the  algorithm  branches  on  a  particular 
policy  tree  by  making  its  specification  more  complete  in  various  ways 
and  hence  creating  a  collection  of  related  trees;  the  lower  bounds  for 
the  newly  created  trees  are  computed  by  simply  adding  a  quantity  to  the 
lower  bound  for  the  former  tree  being  expanded.  These  incremental 


Ti  .  This  upper  bound  is  used  to  eliminate  some  policy  trees  from  con¬ 
sideration  directly.  While  shuch  trees  would  never  be  expanded  in  the 
normal  course  of  computation  anyway,  their  very  presence  consumes  storage 
and  time  in  needless  manipulation. 

The  basic  form  of  our  algorithm  is  the  content  of  [Soland  1966],  except 
for  the  probabilistic  interpretation  of  the  lower  bounds  and  the  compu¬ 
tation  and  use  of  upper  bounds. 

A  policy  tree  will  be  a  partial  specification  for  policies,  and  will 
be  the  device  which  allows  us  to  use  a  branch  and  bound  scheme.  Because 
of  the  inductive  way  this  device  will  be  used,  we  will  define  a  policy 
tree  inductively.  A  pol  i cy  tree  is  a  set  4  of  ordered  pairs  (s,  i) 

of  states  s  and  integers  i  ,  0  s  I  S  n  ,  such  that: 

Bas i s:  4  *  £ ( s° ,  i)}  is  a  policy  tree,  provided  there  exists  a  policy 
tt  such  that  tt(s°)  *  i  . 

Induct  i  on:  If  4.  is  a  policy  tree  and  (s ,  i)  €  4  such  that  i  >  0  and, 

for  s'  *  ( s j . 0. . sn)  and  s"  =  (sj  ,  . . .  ,  1 . . s 

neither  (s',  j)  nor  (s",  k)  are  elements  of  4  for  any 
j  and  k  ,  then  4  U  [(s',  h')  ,  (s",  h")}  is  a  policy  tree 
for  all  pairs  h'  ,  h"  such  that  there  exists  a  policy  tt 
with  tt(s')  *  h'  and  n(s")  *  h"  . 

We  need  to  define  several  quantities  related  to  policy  trees.  First, 
the  policy  tree  4  will  be  said  to  represent  the  policy  tt  when,  for 
all  (s,  i)  €  4  ,  we  have  tt(s)  *  I  .  In  this  way,  policy  trees  partially 

specify  a  policy.  Notice  that  the  larger  the  policy  tree  is,  the  fewer 

policies  it  represents.  When  a  policy  tree  is  as  large  as  possible,  that 
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is,  when  it  cannot  be  extended  according  to  the  induction  step  of  the 
definition  above,  we  say  it  is  a  complete  go  I i cy  tree.  Equivalently,  a 
policy  tree  is  complete  if  it  is  set  maximal  among  all  policy  trees.  We 
will  see  below  that  complete  policy  trees  effectively  represent  only  one 
policy.  One  final  notion  is  that  of  a  terminal  element  of  a  policy  tree. 

The  element  (s ,  i )  €  ^  i  s  a  termina 1  element  of  4  if  i  =  0  or ,  i  f 
i  >  0  ,  then  neither  (s',  j)  nor  (s",  k)  are  elements  of  <4  for  any 

j  and  k  ,  where  s'  “  (s^  ,  . . .  ,  0. . sn)  and  s"  »  (s^  ,  ....  1  .  ,  ....  sn 

Elements  of  4  which  are  not  terminal  are  cal  led  interior.  We  see  the 
terminal  elements  (s  ,  i)  of  a  policy  tree  for  which  i  >  0  are  exactly 
those  referred  to  in  the  induction  step  of  the  policy  tree  definition. 

We  continue  with  some  basic  properties  of  policy  trees. 

Lemma  I :  Every  policy  tree  represents  at  least  one  policy. 

Lemma  2:  A  policy  tree  is  complete  <*>  for  all  terminal  elements  (s,  i) 
of  ^  ,  we  have  i  *  0  *>  all  policies  represented  by  4  are  equivalent 
to  the  user,  i.e.,  implementation  of  any  two  policies  represented  by  4. 
a.s.  lead  to  the  same  sequence  of  tests. 

Lemma  3:  Let  ^  be  any  policy  tree  and  n  be  any  policy  represented  by 
A  .  Then 

(1)  T^s0)  «  5>(S)  t.  +  £p(S)  T„(s)  . 

(s,  i)  interior  (s,  i)  terminal 

elements  of  4  elements  of  4 

Proofs:  Lemmas  I  and  2  are  easily  shown  by  induction.  Lemma  3,  also 
shown  by  induction,  is  proven  here.  We  will  show  that  Equation  (1)  holds 
for  all  policies  represented  by  4  by  showing  inductively  that  the  class 
of  policy  trees  for  which  (1)  holds  contains  all  policy  trees. 
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Basi  s:  If  ^  *  ((s°,  03  .  then  (0  clearly  holds;  (s°,  i)  is  a 
terminal  element. 

Induction:  Let  ^  be  any  policy  tree  for  which  (l)  holds.  Let  (s,  i) 
be  a  terminal  element  of  ^  with  i  >  0  .  Let  s'  = 

*si . °r  sn*  and  s"  =  ^Sr  ’i . Sr ?  • 

If  (h1,  h")  are  such  that  for  some  policy  tt  ,  tt(s')  =  h* 
and  n(s")  *  h"  ,  then  3  *4.11  {(s',  h ' )  ,  (s",  h")}  is  a 
policy  tree  also.  We  show  (I)  holds  for  3  .  If  tt  is 
represented  by  3  ,  then  n  is  represented  by  d  ,  tt(s')  =  h' 
and  tt(s")  *  h"  .  Hence 

P(s)Tn(s)  -  P(s)  ft.  +  P{Y.  -  0  |  s}Tn(s')  +  p{y .  =  I  |  sjT^s") 

-  P(s) tj  +  P(s')Tn(s')  +  P(3")Tn(s")  . 

It  now  follows  (1)  holds  for  B  .  (s',  h 1 )  and  (s",  h")  are  terminal 

elements  of  3  ,  while  (s,  i)  becomes  an  interior  element  of  3  .  II 
A  lower  bound  on  T  (s°)  over  policies  represented  by  a  specific 
policy  tree  is  generated  by  Lemma  3  and  the  following  probabilistic  con¬ 
siderations.  Let  TTj  be  any  policy  satisfying:  Vs  ,  rr^  (s)  *  j  => 

Vi  j  ,  Sj  »*  -I  .  Thus  m  tests  Yj  only  after  all  other  Y.  have 
been  tested.  Many  policies  have  this  property,  but  the  random  event  of 

testing  Yj  is  the  same  for  each  such  policy.  Letting  n  be  any  policy, 

o 

we  have,  for  the  problem  beginning  in  the  nominal  initial  state  s  , 

(2a)  (Yj  is  tested  by  using  policy  TTj}  c  (Yj  is  tested  by  using  policy  tt}  . 
It  follows  that,  for  any  state  s  3  P(s)  >  0  . 


( 
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Pfbeginnlng  in  state  s,  Y.  is  tested  by  using  policy  n.  |  s}  S 

(2t>)  r  J  J, 

Plbeginning  in  state  s,  Yj  is  tested  by  using  policy  rt  [  s]  . 


Lemma  4:  Let  n.  be  as  described  above.  Let 

t .P{beginning  in  state  s,  Yj  is  tested  by  using  policy  ru  |  s] 
Ij(s)  =  |  if  P(s)  >  0 

^0  otherwise, 


and  let  t  *  0  .  Then,  for  any  state  s  and  policy  tree  ^  , 


(i)  for  all  policies  it  3  n(s)  *  i  ,  £  I  -  (s)  +  t.  £  T  (s) 

j-|  J  1  " 

jVi 

(ii)  for  all  policies  tt  represented  by  ^  , 


(3)  T  (s°)  *  E  P(S)t,  ♦  E  P(S)  E  !,(*)  • 

(s.ije^  (s,i)  terminal  j  =  l  J 

element s  of  ^  j?<i 

Denoting  this  lower  bound  by  L(4)  ,  we  have 

(iii)  for  all  policies  tt  represented  by  &  ,  T  (s°)  s  L(tf)  and, 


n 


when  &  is  complete,  T  (s°)  =  L^£) 


Proof:  (ii)  and  (iii)  follow  from  (i)  and  Lemma  3.  (0  follows  from 

Equation  (2)  above  and  the  formula: 
n 

£t.P{Y.  is  tested  when  beginning  in  state  s  and  using  n  |  s} 
i-1  J  J 

J  if  P(s)  >  0 

T„<5> 


o  if  P ( s)  •  0  .11 

An  upper  bound  on  the  value  of  an  optimal  policy  can  be  computed  in 

terms  of  quantities  which  will  be  on  hand  from  making  the  lower  bound 

computations.  Specifically,  let  (2.  be  any  policy  tree  and  n  any 
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policy  such  that  for  all  interior  elements  (s,  i)  of  4  ,  we  have 
rr(s)  =  i  .  Thus  tt  may  not  be  represented  by  4  .  However,  Equation 
(1)  remains  valid  and  is  proven  the  same  way.  Recalling  this  formula. 

T(S°)  -  £  p(s)t.  +  £  P(S)Tn(s)  . 

(s,  i)  interior  (s,  i)  terminal 

elements  of  elements  of  4 

Now,  let  (s,  i)  be  a  terminal  element  of  for  which  i  >  0  .  Suppose 
rr  is  such  that,  beginning  in  state  s  ,  Yj  is  not  tested  unless  it  is 
necessary,  that  is,  unless  tt  has  tested  all  untested  Yj  (those  with 
Sj  *  -1)  except  Y.  ,  and  still  it  is  necessary  to  know  Y.  in  order  to 
determine  the  correct  action.  Thus  ti  is  a  n.  policy  as  described 
above.  Then  Ij(s)  *  t.P{Y.  's  tested»  beginning  in  state  s  and  using 
pol  icy  tt  |  s 3  so 

Tn00  *  £  t  ♦  1.(5)  . 

jJSj-l  J 

j* 

It  follows  that  for  some  policy  tt  , 

(4)  T  (s°)  *  £  P(s)t.  +  £  P(s)  £  t  +  I  (s)  . 

(s,i)  interior  ( s  ,  i )  terminal  j3s.  =  -l  J 

element  of  4-  element  of  4  3  A. 

i  >  0 

Denoting  this  upper  bound  by  U(iZ)  ,  we  have  for  all  policy  trees  4  , 

T  (s°)  s  U(£)  ,  where  tt  denotes  an  optimal  policy,  and  T^  (s°)  =  U 
o  o 

whenever  4.  is  complete.  Our  branch  and  bound  scheme  will  generate 

various  policy  trees  and  the  quantities  necessary  to  compute  U(^Z)  . 

The  following  formulas  give  an  explicit  form  for  Ij(s)  and  also 

allow  us  to  see  how  the  bounds  L (fit)  and  U fat)  are  computed  sequentially 


as  needed.  Def i ne 
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n(s)  -  |(U  =  (u), ,  ...  ,  UJn)  e  Co ,  !]n  I  Vj  3  Sj  =  0  or  1  ,  we  have  ujj  = 

so  in  particular,  Q  *  fi(s°)  =  {0,  1  }n  .  One  can  think  of  oi  €  as  an 
outcome  of  the  random  vector  Y  =  (Yj  ,  ...»  Y^)  .  The  probability  of  such 
an  outcome  is  P{Y  =  u>3  •  Let 


Aj  (w) 


Aj  (u)) 


if,  for  u)'  *  ((Dj . (1  -  UDj) . tun)  ,  we  have 

P(Y  *  u)}  >  0  ,  P(Y  “  u)'}  >  0  and  <w  and  id'  belong  to 
distinct  sets  of  the  partition  {Aj ,  A^)  . 

otherwi se. 

if,  for  u)'  -  (<Dj  ,  . . . ,  (1  -  ujj) ,  . . .  ,  u>n)  ,  we  have 
P[Y  *  uj}  *  0  or  P(Y  ■  u)‘]  =  0  or  id  and  id1  belong  to 
a  common  set  of  the  partition  {Aj  ,  . Arf}  . 
otherwi se. 


These  are  devices  for  computing  Ij(s)  .  For  states  s  with  P(S)  >  0  , 


Ij(s) 


It.  £  P{Y  «  ID  I  s}a.(uu)  if  s.  *  -1 
J«S1  J  J 


if  s .  *  0  or  1 
J 


However,  what  really  appears  in  computing  L(£)  and  is  P(s)I.(s) 

For  any  state  s  and  integer  j  ,  I  s  j  s  n  ,  define 


t.  £  P(Y  ■  u)}A.(id)  if  s.  ■  -1 


E.(s) 


|  A., PlY ' 

I 


if  s.  a  0  or  1  , 
J 


so 


(5) 


P(s)Ms)  =  E .  (s)  . 
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For  any  state  s  and  integer  j  ,  1  £  j  s  n  ,  define 


Ej(s) 


t.  52  P{Y  *  uu}Z.(u>)  if  s.  ■  -1 
iu6n(s)  ■* 


if  s.  *  0  or  I 
J 


Notice?  Aj(u>)  +  Aj  (<u)  =  1  ,  so 

(6)  E.(s)  +  E.(s)  -  tjP(s)  . 

Now,  recalling  the  formulas  (3)  and  (4),  we  can  write  L(£)  and  U(£) 
in  terms  of  Ej  (s)  . 


(7)  l w  -  £  p(s)t.  +  52  52  Ej(s)  . 

(s,  i)€4  (s,  i)  terminal  j«l  J 

element  of  A.  jjH 

(8)  urn  -  E  p(S)t.  +  E  p(s)  T.  me 

(s,  I)  interior  (s,  i)  terminal  j5s.*-l  J 

element  of  element  of  CL  3  A. 

i  >  0  Jf 1 


In  particular,  for  a  policy  tree  of  the  form  CL  *  {(s°,  i)}  with  i  >  0 
using  (6)  we  see 

(9)  IfcO  *  E  •  £  E.(s°)  +  E.(s°)  . 

j-i  1  j»i  J 

(10)  U(tf)  -  E  «i  -  8|(s°)  • 

J-l  J  J 

Now,  if  we  extend  a  policy  tree  ^  ,  we  want  to  obtain  the  bounds  for  the 

new  tree  in  terms  of  the  bounds  on  CL.  Specifically,  let  (s,  i ' )  be  a 

terminal  element  of  the  tree  ^  wi  th  i  >0  .  For  s'  *  (sj  ,  ....  0.,  . 

and  s"  *  (s.  1 s  ),  i f  h'  and  h"  are  such  that  there 

i  i  n 

exists  a  policy  with  tt(s')  ■  h*  and  tt(s")  ■  h"  ,  then  according  to 
the  induction  step  of  the  definition  of  policy  tree,  3-<2.u  ((s',  h')  , 
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(s",  h")}  is  also  a  policy  tree;  using  (7),  (8)  and  (6),  we  have 

(11)  L<3)  -  L(tf)  +  Eh,(s')  +  Eh,((s") 
and 

(12)  u(3)  =  U (O)  +  E.(s)  -  Eh,  (s')  -  Eh,((s") 

-  1  (h»  =  O)P(s')  D  t  -  1(h"  *  0)P(s")  D  t 
jBsj»-l  J  j3s'H  J 

where  we  have  taken  Eq(s)  =  0  for  every  state  s  and 

1  if  h  ■  0 

1 (h  *  0)  =  ,  h  *  h 1  ,  h"  . 

0  if  h  0 

Let  (Yj . Yn  ;  Aj  ,  Arf)  be  a  decision  problem.  The 

following  is  a  "branch  and  bound"  algorithm  for  finding  an  optimal  policy. 

Step  1:  If  every  policy  tt  stops  at  initial  state  s°  (corresponding 
to  the  correct  action  being  a  constant  almost  surely),  then  T^s0)  =  0  . 
If  not,  then  Vi  3  1  £  i  *  n  ,  3  policy  tt  3  rr(s°)  *  i  .  Let 

*9"*  {#.  j  i  =  1,  2 . n}  where  =  {(s°,  i))  ,  a  policy  tree. 

Notice  every  policy  is  represented  by  exactly  one  of  the  policy  trees  in 
the  family  «?“.  Set 

L«V  -  t,  *  i  Ej(.°) 

J«l  J 

jVi 

and 

U(0  -  E  (s°)  +  £  t. 

'  1  j-1  J 

J*i 

and 

U  »  Min  U(<?.)  . 
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Then  (s°)  £  U  and,  for  all  1  and  n  with  tt(s°)  =  i  ,  Lfc^. )  s 
° 

Tn(s  )  .  Continue  to  Step  2. 

Step  2:  Reduce  the  set  eJF  by  removing  any  A.  L fa.)  >U  ;  those 
policy  trees  cannot  represent  an  optimal  policy.  Let  the  policy  tree£ 
satisfy  L(£)  *  Hin  L(£.‘ )  over  A  1  €  &  .  If  A  is  a  complete  policy 
tree  (cannot  be  augmented),  then  all  policies  it  represents  are  equiva¬ 
lent  to  the  user  (Lemma  2)  and  any  one  of  them  is  optimal,  since  for  such 
a  policy  n  ,  L(4)  ■  T^s0)  .  If  A  is  not  complete,  continue  to  Step 
3. 

Step  3:  Since  4-  is  not  complete,  there  exists  a  terminal  element 
(s,  i)  of  4-  wi th  i  >  0  .  Let  s 1  -  (Sj  ,  . . .  ,  Oj  ,  . . .  ,  sn)  and 

s"  «  (Sj . lj,  ...,  sn)  .  Augment  <£"  by  removing  A-  and  replacing 

AU  {(s',  h ' )  ,  (s",  h")}  for  each  pair  (h1,  h")  such  that  for  some 
policy  tt  ,  tt(s')  ■  h'  and  tt(s")  -  h"  .  The  possible  values  for  h' 
will  be  either  {o}  (if  s'  Is  a  state  in  which  no  more  testing  is 
necessary)  or  [j  j  sj  *  -l)  (any  Yj  not  already  tested);  possible 
values  for  h"  are  analogous  with  respect  to  s"  .  For  any  such  pair 
(h 1 ,  h")  ,  set 

L(£  U  {(s' ,  h')  .  (s",  h")})  -  LW  +  Eh,(s')  +  Ehll(s") 
and 

UfctU  t.(s\  h«)  ,  (s",  h")})  -UW+  E.(s)  -  Eh,(s')  -  Eh,((s") 

-  1  (h*  -  O)P(s')  Z  t.  -  1  (h"  ■  0)P(s")  L  t.  . 

j=>sj-l  J  jSs'.'-l  J 

Recompute  the  upper  bound  U  by  U  *  Min  U(tf')  over  £  S'  and  return 
to  Step  2. 


D .  An  Examp  1  e 


We  conclude  this  chapter  with  a  numerical  example  of  the  above  branch 

and  bound  method.  The  origin  of  the  example  is  a  fault  finding  problem 

which  will  be  fully  described  in  Chapter  2. 

We  first  remark  that  for  any  decision  problem  (Y. . Y  , 

i  n 

Aj . A^)  ,  it  is  sufficient  to  know  only  the  sets 

A.  -  |(U  €  {0,  l}n  |  a)  6  A.  ,  P[Y  -  id)  >  o}  . 

All  the  quantities  computed  in  the  course  of  the  algorithm  remain  un¬ 
changed  when  A.  is  replaced  by  Aj  ,  and  we  could  have  assumed  ini¬ 
tially  that  the  sets  Aj . A^  only  partition  U  6  (o.  i}n  I  p(y  -  uu}  >  0], 


Data  for  example: 


P{Y  -  u)} 


0  1/74  0  0  7/74  3/74  42/74  21/74 


U)|  I  1 

U)2  1  1 

u >2  1  o 


1  0  1 


0  I 


1  1 


Each  column  of  the  table  represents  an  u)  €  {0,  l]^  .  The  sets 
Aj  ,  ...»  A  ^  are  singletons,  so  a  policy  must  determine  for  each  out¬ 
come  of  the  random  vector  Y  which  elementary  outcome  <u  <z  ft  has 
occured.  Clearly,  no  policy  can  stop  by  making  no  tests.  We  recall 

that  E.(s)  ■  t.  21  A.  (oj)  and 
J  J  «u€n(s)  J 
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Aj  (u>) 


i  f ,  for  id1  ■  (aij . (I  -  u)j )  ,  ...»  <un)  ,  we  have 

p(Y  *  uj}  *  0  or  P{Y  ■  a)'}  ■  0  or  u>  and  u)'  belong  to 
a  common  set  among  the  sets  £a ^  ,  . . .  ,  A^}  . 
otherwi se. 


However,  for  our  example,  we  can  take 

/  1  If  ib.  ■  0  and  P{Y  *  u>}  >  0  and  P(Y  *  (u)j  ,  . . . ,  I j . «)n) )  =  0 

Aj(u>)  -  ! 

'0  otherwise 

without  changing  the  value  of  P  £y  ■  tujZ.  (u>)  .  This  particular  form  for 
Z.  (u>)  always  occurs  for  fault  finding  problems  such  as  this  one.  For 
our  example, 

Ej(s°)  *  4(42/74)  ■  84/37  corresponding  to  u»  *  (0,  0,  l) 

§2(s°)  ■  3(42/74)  -  63/37  corresponding  to  id  ■  (0,  0,  1) 

Ej(s°)  -  16(1/74+  1/lk  +  3/74)  ■  88/37  corresponding  to  u>*  (1,  I,  0), 
id  ■  (1,0,  0)  and  id  ■  (0,  1,0)  respectively. 

We  have  maintained  a  record  of  those  uj  3  p£y  ■  u>)A.  (cd)  >  0  for  each  j  . 
When  computing  Ej (s)  in  steps  to  come,  we  can  use  this  data  rather  than 
having  to  scan  over  the  original  data  table.  Following  Step  1  of  the 
algorithm,  we  will  compute  the  upper  and  lower  bounds  associated  with  the 
trees  d. {  ■  ((s°,i)j  ,  for  i  ■  I,  2,  3  • 

L«.)  -  E  t ■-  t  e,(s°)  +  E,(s°) 

J-l  J  j-l  J 

-  23  -  235/37  ♦  E|(?°)  -  616/37  +  E.(s°)  . 

3  „  .  . 

VIA,)  -El:'  E.(s°)  -  23  -  E. (s°)  . 

j-l  1 
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We  will  represent  policy  trees  graphically,  making  it  clear  how 
they  are  a  partial  specification  of  policies.  For  example,  the  trees 
*  {(s°,  i ) }  and  their  bounds  are: 


L  «  700/37 
U  -  767/37 


L  -  679/37 
U  -  788/37 


L'  *  704/37 
U  -  763/37 


The  current  upper  bound  is  U  ■  Min  Ufc?,)  *  763/37 


1 

Proceeding  to  Step  2  of  the  algorithm,  we  see  none  of  the  three 
trees  generated  so  far  have  lower  bounds  which  exceed  the  current  upper 
bound  U  .  We  see  policy  tree  & ^  ,  representing  all  those  policies 
which  begin  by  testing  ,  has  the  lowest  lower  bound  among  all  policy 

trees  thus  far  generated.  As  this  tree  is  not  complete,  we  begin  by 
extending  this  tree.  A  policy  cannot  stop  in  states  (- 1  ,0,-1)  or 
(-1 ,1 ,-1 )  as  knowledge  of  alone  is  not  sufficient  to  determine  both 

Yj  and  Y^  ,  regardless  of  the  valu-?  Y ^  takes.  Hence,  ((s°,  2)}  has 
four  extensions: 


(s°,2) 


(s°,2) 


(s°,2) 


0/  \1  0/  \  l  0  /  \  1 

(-1, O', -0,1  (-1,1, '-I),l  (-1,0, -0,3  (-1 ,1  '-0  .3  (-1 ,0,-0 ,1  (-1, 1,-1), 3 


(s°,2) 


(-1,0, -0,3  (-1,1, -0,1 


To  compute  the  appropriate  lower  bounds,  we  have 
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Ej(-1,0,-1)  *  4(42/74)  »  84/37  corresponding  to  w  8  (0,  0,  I) 

EjM.O.-!)  =  *6(7/74)  =  56/37  corresponding  to  tu  =  (1  ,  0,  0) 

E,  (-1  ,1,-1)  =  0 

E^-l  .1  ,-l)  ■  16(1/74  +  3/74)  =  32/37  corresponding  to  ui*  (1,  1,0) 
and  (0,  I,  0)  respectively. 

We  see  that  to  compute  say  E^(-l,  1,  -1)  ,  we  need  only  scan  over  uj  6 

n(-l  ,  1,-1)  (i.e.,  111  3  (iij  ■  I)  for  which  P{Y  =  u>}Aj  (tu)  >  0  -  But  th  i  s 

list  of  a)  and  corresponding  P £ Y  =  u)}  has  already  been  generated  in 

computing  E^(s°)  .  The  lower  bounds  for  these  newly  created  trees  are 

easily  computed.  They  are,  respectively,  763/37,  767/37,  795/37,  735/37. 

The  best  (lowest)  of  the  new  upper  bounds  created  is  U(#2)  +  E2(s°)  - 

E  |  ( —  1  ,  0,  -1)  -  E^(-l,  1,  -l)  »  735/37  ,  hence  the  new  value  of  U  is 

735/37.  Returning  to  Step  2,  we  see  the  first  three  of  the  four  newly 

created  trees  can  be  dropped  from  further  consideration,  since  their 

lower  bounds  exceed  the  current  upper  bound  U  on  T  (s°)  .  We  illus- 

o 

trate  the  remaining  policy  trees  while  suppressing  the  states  involved. 
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Now,  the  first  policy  tree  listed  above  has  the  lowest  lower  bound, 
and  since  it  is  not  complete,  we  continue  to  Step  3.  The  computations 
are  analogous  to  those  made  above. 

E2  (0  l )  *  3(42/74)  ~  63/37  corresponding  to  u>  =  (0,  0,  1) 
E.(0,-1,-1)  =*  16(3/74)  *  24/37  corresponding  to  uj  =  (0,  1,  0) 
E2(1.-|.-1)  =  0 

E^(l  ,-!  ,-l)  *  16(1/74  +  7/74'  =  64/37  corresponding  to  10*  (I,  1,  0) 
and  (1,  0,  0)  respectively. 

The  extensions  of  ((s°,  I)}  are: 


L  -  763/37  L  -  788/37  L  -  827/37  L  -  724/37 

The  best  af  the  upper  bounds  generated  is  UG?j)  +  Ej  (s°)  -  ^(O,  -1,  -I) 
-  Ej(l  ,  —I,  -1)  or  724/37  1  bringing  the  current  value  of  U  to 
724/37  .  Accordingly,  the  only  policy  trees  remaining  to  be  considered 
are: 


L  -  724/37  L  -  704/37 

Expanding  the  second  policy  tree,  we  see  if  Y^  a  1  ,  then  Yj  *  ■  0 
a.s.,  so  no  more  tests  can  be  made.  If  Y^  *  0  ,  then  a  policy  can  test 
either  Yj  or  .  Comparing  these  two  extensions  (k  indicates  the 
"correct  action",  so  Y  €  A^  a.s.): 


21 


I 

stop ,  k  *  4 
l 

1 

stop,  stop,  stop,  stop, 

k-5  k  *  3  k  =  2  k  =  1 

L  -  704/37 

This  tree  is  complete.  It  is  a  complete  specification  of  a  policy  for 
the  states  relevant  to  that  policy.  It  follows  that  for  any  n  repre¬ 
sented  by  this  tree  (they  are  all  equivalent),  T^s0)  =  704/37  and  tt 
is  optimal. 

E .  Deadline  Criterion 

Another  criterion  with  great  intuitive  appeal  that  might  be  applied 
to  a  policy  is  the  deadline  criterion.  This  is  simply  the  probability  of 
having  completed  testing,  determining  the  correct  action,  on  or  before 
some  specified  deadline  t  .  Policies  in  this  case  are  rules  telling  the 
user  which  Yj  to  test  next,  or  to  stop,  depending  on  the  current  state 
of  the  problem  and  the  time  remaining  until  the  deadline. 

There  is  a  branch  and  bound  technique  for  solving  this  problem  ana¬ 
logous  to  the  one  given  above.  We  will  describe  a  bound  which  could  be 
used  to  generate  an  algorithm,  however,  it  does  not  seem  particularly 
well  suited  to  the  general  problem.  In  Chapter  3  we  will  discuss  special 
decision  problem  structures  with  the  deadline  criterion. 


. . 
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In  order  to  generate  a  branch  and  bound  procedure  for  this  criterion, 
we  need  a  lower  bound  on  P{'rT7  >  t  |  s]  over  all  policies  tt  ,  for  each 
state  s  and  time  t  for  which  P (s)  >  0  ,  where  Tn(s)  is  the  random 
time  until  the  process  stops,  beginning  in  state  s  and  following  policy 
tt  .  Equivalently,  an  upper  bound  on  P{ttj(s)  £  t  |  s}  over  all  policies 
tt  or  an  upper  bound  on  P[3  policy  tt  9  Tn(s)  5  t  I  s}  would  be  sufficient. 
The  bound  results  by  computing  the  probability  of  this  event  exactly. 

It  would  appear  that  to  compute  this  probability  each  time  the  algorithm 
branches  would  make  a  considerably  long  computation.  In  particular,  an 
incremental  process  by  which  lower  bounds  were  able  to  be  computed  above 
no  longer  works  here. 

We  will  mention  that  the  dynamic  programming  solution  is  easy  to 
formulate,  however,  the  optimal  value  function  now  depends  on  the  state 
of  the  problem  and  the  time  remaining.  This  indicates  that  the  function¬ 
al  equation  must  be  solved  about  3°  x  t  times,  where  t  is  the  initial 
deadline  (assuming  all  times  are  integer). 
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CHAPTER  2 


GENERAL  FAULT  FINDING  MODELS  FOR  COHERENT  SYSTEMS 


Introduct ion 

In  this  chapter  we  introduce  coherent  systems,  a  reliability  model 
for  "go,  no-go'  systems  and  give  a  description  of  some  general  faultfinding 
problems  which  are  solvable  as  decision  problems.  An  example  is  included. 


A.  Coherent  Systems 


Coherent  systems  arise  from  the  study  of  a  physical  system  whose 
operation  is  classified  as  either  functioning  or  failing,  where  this 
operation  is  determined  by  the  joint  functioning  or  failing  of  some  finite 
set  C  of  components.  Accordingly,  a  joint  performance  of  the  components 
C  is  a  function  X  :  C  -•  { 0  ,  1 }  with  the  interpretation  that  for 
all  c?C  , 


X(c) 


0  if  component  c  fails 


(1  if  component  c  functions  . 

A  system  (not  necessarily  coherent)  will  be  denoted  by  (C  ,  cp)  , 

cp  is  a  function  from  all  joint  performances  of  the  components  to 

with  the  interpretation  that  for  all  X  , 

(0  if  the  system  falls  under  joint  performance  X 

<p(X)  -  | 

I  1  if  the  system  functions  under  joint  performance  X  . 


where 
(0  ,  1) 


Coherent  systems  are  those  systems  for  which  the  replacement  of  a  failed 
component  by  a  functioning  one  will  not  cause  a  functioning  system  to  fail. 
To  express  this  precisely,  we  say,  for  joint  performances  X  and  Y  , 

X  s  Y  whenever  for  all  c  €  C  ,  X(c)  £  Y(c)  .  Then  coherent  systems 

are  those  systems  (C  ,  co)  for  which 

(1)  X  s  y  »>  cp(X)  *  <p(Y)  . 
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We  see  some  components  may  have  no  effect  whatsoever  on  the  system's 
performance.  Such  components  are  referred  to  as  inessential  components. 
Components  not  inessential  are  said  to  be  essential.  It  follows  that  a 
component  c  f  C  is  essent  i a  >  to  (C  ,  cp)  if  and  only  if 


(2) 


3  X  9  cp(X  ,  0  )  ^  cp( X  ,  1  )  ,  where 
c  c 

|X(e)  if  e  c 

K  if  e  -  c  . 


(X  ,  Kc)  (e) 


Some  common  examples  of  coherent  systems  are  the  series  and  parallel 

systems.  A  series  system  functions  if  and  only  if  all  its  components 

function  while  a  parallel  system  fails  if  and  only  if  all  its  components 

fail.  These  are  special  cases  of  the  k-out-of-n  systems,  (k/n)  .  A 

(k/n)  system  functions  if  and  only  if  k  or  more  of  its  n  components 

function.  In  our  notation,  we  say  (C  ,  cp)  is  a  (k/n)  system  when 

C  contains  n  components  and  cp( X )  *  I  <*>  £  X(c)  i  k  . 

cGC 

Coherent  systems  have  been  studied  in  the  monograph  [Barlow  and 
Proschan,  1956]  as  well  as  the  literature.  See  [Birnbaum,  Esary  and 

Saunders,  1961]  and  [Esary  and  Proschan,  1963]  for  a  basic  exposition  of 

coherent  systems,  particularly  of  the  properties  of  the  reliability 
function  P{cp(X)  *  1}  ,  the  probability  the  system  functions,  when  the 
performance  of  each  individual  component  is  random  and  independent  of  the 
other  components.  An  application  of  coherent  systems  to  characterize  a 
class  of  life  distributions,  those  with  increasing  hazard  rate  average, 
is  given  in  [Birnbaum,  Esary  and  Marshall,  1966].  Coherent  systems  have 
also  been  studied  under  the  guise  of  blocking  systems.  ([Butterworth, 
1969]  ,  [Fulkerson,  1968]). 

Our  definition  of  coherent  systems  in  this  chapter  will  differ  from 
that  found  in  [Birnbaum,  Esary  and  Saunders,  1961]  and  elsewhere.  A  minor 
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difference  is  that  our  definition  allows  all  components  to  be  inessential. 

A  significant  difference  is  that  while  the  assumption  of  independence,  or 
in  some  instances  the  weaker  assumption  of  association  (e.g.,  see  [Esary, 
Proschan  and  Walkup,  1966]),  is  usually  made,  we  will  allow  the  joint 
performance  distribution  to  be  arbitrary. 

B.  Some  Fault  Finding  Models  for  Coherent  Systems 

Consider  a  coherent  system  (C  ,  cp)  for  which  the  joint  performance 
of  the  components  is  random  according  to  a  known  joint  distribution.  Let 
the  components  be  indexed  c^  , . . . ,  cn  and  suppose  one  can  test  component 
c.  ,  requiring  time  t.  ,  to  learn  whether  it  is  functioning  or  not.  The 
object  is  to  determine  the  state  (function  or  failing)  of  each  component 
a.s.  by  sequentially  testing  some  components,  in  minimum  expected  time. 

A  policy  for  testing  is  a  feedback  rule  telling  the  user  which  component  to 
test  next  or,  if  enough  information  has  been  gleaned,  to  stop,  based  on 
the  previous  tests  and  their  results. 

This  is  clearly  a  decision  problem.  To  make  the  formal  identification, 
let 

!0  if  component  c.  is  failed 

' 

1  if  component  c.  is  functioning  . 

The  sets  Aj  , . . . ,  A^  can  be  taken  as  the  singleton  subsets  of  fO  ,  l}n  , 

clearly  a  partition.  Knowing  the  state  of  each  component  is  then  equivalent 

to  knowing  for  which  k  we  have  Y  6  A  a.s.  We  remark  that  for  any 

k 

decision  problem,  it  is  sufficient  to  know  only  the  sets 

A.  n{u>  i  P(Y  =  u,}  >  0  }  , 

and  we  could  have  assumed  initially  that  the  sets  Aj  ,...,  partition 
|uj  |  P{Y  =  (u)  >  0  }  . 
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We  hasten  to  remark  that  the  introduction  of  coherent  systems  in  the 
above  model  serves  only  as  a  framework.  The  statement  of  the  problem 
does  not  depend  on  the  structure  function  c p  of  the  coherent  system 
(C  ,  cp)  .  Indeed,  the  problem  is  completely  specified  by  the  distribution 
of  the  joint  performance  X  and  the  testing  times  t.  . 

The  reason  for  this  is  the  nature  of  our  objective  in  testing  the 
system.  The  object  was  to  learn  as  much  as  possible  about  the  system, 
namely  the  joint  state  of  the  components.  This  resulted  in  the  partition 
Aj  Aj  of  {0  ,  1  }n  being  the  singleton  subsets.  We  can  formulate 

objectives  for  testing  a  set  of  components  which  require  learning  some¬ 
thing  less  than  everything  from  our  testing,  and  which,  as  is  to  be 
evicted,  lead  to  a  coarser  partition  of  [0  ,  l}n  than  the  one  above. 

Two  such  examples  follow,  in  which  it  happens  that  the  structure  function 
9  does  play  a  role  in  specifying  the  problem. 

The  fi  rst  such  model  assumes,  as  above,  that  the  joint  performance  X 
of  a  set  of  components  C  is  random  according  to  some  known  but  arbitrary 
distribution.  Suppose  then  that  (C  ,9)  is  a  coherent  system  and  that 
we  wish  to  determine,  in  minimum  expected  time,  whether  the  system  is 
functioning  or  not  when  its  components  are  performing  according  to  X  . 

As  usual,  we  can  test  component  c.  in  time  t.  to  learn  its  state. 

This  is  a  decision  problem  with  Yj  ■  X(c.)  ,  i  =  1  n  ,  and  with  the 

partition  Aj  A^  having  just  two  sets, 

A,  -  |u)  |  for  x(Cj)  -  (JOj  ,  i  =  l  ,...,  n  ;  9U)  =■  oj 
and  its  complement 

A2  «  Jiu  |  for  x(cj)  *  u).  ,  i  =  1  ,...,  n  ;  9(x)  =  oj. 

The  framework  of  coherent  systems  provides  a  natural  example  of  a  problem 
in  which  our  objective  requires  learning  something  less  than  everything 
about  the  system. 
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Another,  somewhat  artificial,  example  is  what  we  will  call  the 
repair  problem.  As  above,  weare  given  a  coherent  system  (C  ,  co)  for 


which  the  components  are  performing  according  to  the  random  joint 
performance  X  .  In  this  case,  a  failed  component  can  be  repaired  at  a 
cost  of  r.  £  0  for  component  c.  .  The  object  is  to  learn  what  is  the 
least  (repair)  costly  set  of  failed  components,  say  R  ,  which,  when 
repaired,  will  insure  the  system's  operation  a.s.  If  one  knew  what 
joint  perf  omance  x  :  C  -*  {0  ,  l}n  had  occurred,  it  is  clear  that  R 
would  satisfy 


c.e* 

i 


r.  |  T  are  failed  components  which,  when 


E  ^  *  Hin  \  £ 

R  Jc.a 

repaired,  insure  the  system's  operation,  i.e.,  for 


(3) 


y(tj)  *  \  if  x(c.)  *1  or  if  Cj€T,y(c.)«0  otherwise  , 
we  have  rp(y)  “  1  i  . 


For  example,  th.s  set  R  would  be  empty  if  no  repairs  were  necessary, 
i.e.  co(x)  «  I  .  For  purposes  of  making  this  example  well  formed,  assume 
that  for  every  joint  performance  x  with  positive  probability 
(PfX  *  x]  >  0)  ,  there  is  a  unique  set  of  R  of  failed  components  satisfying 
equation  (3).  This  set  R  might  be  the  same  for  several  outcomes  x  ,  and 
consequently  it  might  not  be  necessary  to  determine  specifically  the  state 
of  every  component.  In  any  case,  we  can  ask  for  a  testing  policy  which, 
in  minimum  expected  time,  determines  just  what  the  set  of  failed  components 
R  is.  The  resulting  decision  problem  has  Y.  ■  X(c.)  ,  i  *  1  ,...,  n  , 
as  usual.  The  partition  Aj  A^  is  induced  by  the  repair  costs  r. 

and  the  structure  function  ©  of  the  coherent  system.  Two  elements  uj 
and  u>'  of  fO  ,  1}°  with  positive  probability  will  belong  to  a  common 
set  A.  whenever,  for  x(c.)  *  uu.  and  x'(c.)  =  uj'  ,  i  *  I  ,...,  n  , 
x  and  x*  are  joint  performances  for  which  the  sane  set  of  components 
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R  satisfy  equation  (3).  Our  assumption  that  R  is  unique  for  each  x 
insures  that  this  definition  makes  A  a  partition.  If  R 

is  not  uniquely  determined  by  (3)  for  each  joint  performance  x  ,  the 
above  problem  formulation  breaks  down.  Clearly  one  can,  for  each  x  , 
designate  a  specific  R  satisfying  (3)  and  use  it.  However,  this 
procedure  can  lead  to  a  sub-optimal  policy.  Notice  that  we  are  content 
to  restrict  our  attention  to  those  u>  for  which  P(Y  *  uj]  >  0  . 

We  should  explain  that  the  following  similar  problem  is  different 
from  the  model  described  above.  Given  a  testing  time  tj  and  a  repair 
time  Tj  for  each  component  Cj  ,  perform  the  necessary  tests  and  repairs 
to  the  components  in  order  to  guarantee  the  system  will  operate  a.s.,  all 
in  minimum  expected  total  time.  Our  solution  of  the  repair  problem  would 
give  a  sub-optimal  policy  for  this  problem,  since  it  would  test  enough 
components  to  determine  the  minimizing  set  if  equation  (3).  However,  it 
may  really  be  optimal  to  stop  testing  prior  to  this.  For  example,  if  the 
sum  of  all  the  repair  times  for  all  components  is  less  than  every  test 
time,  any  optimal  policy  would  make  no  tests  whatsoever  and  instead  would 
make  enough  repairs  so  as  to  Insure  the  system's  operation  a.s.  on  the 
basis  of  no  tests,  repairing  all  components  if  necessary.  It  appears  that 
this  problem  can  not  be  handled  by  the  problem  formulation  of  Chapter  I. 

It  does,  of  course,  have  a  straightforward  dynamic  programming  solution. 

C.  An  Example 

Let  us  illustrate  the  models  presented  so  far  with  an  example.  The 
coherent  system  for  our  example  is  one  witii  three  components,  and  is 
easiest  to  specify  by  the  following  sketch. 
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Figure  1 


Connections  through  a  box  labelled  Cj  are  present  if  and  only  if  component 
Cj  functions.  The  system  functions  if  and  only  if  there  is  a  path  through 
the  network.  Our  system  then  functions  whenever  components  c^  and  , 
either  Cj  or  c^  function.  The  distribution  of  the  joint  performance 
X  is  assumed  to  be  the  following:  we  suppose  that  nominally  the  components 
operate  independently  of  each  other,  with  P(cj  functions}  =  1/4  , 

P{c2  functions}  ■  1/8  and  P{c^  functions}  *  2/3  .  However,  we  suppose 
further  that  the  system  is  down  (not  functioning),  so  the  true  distribution 
of  X  is  the  conditional  distribution  P{X  »  x  |  cp(X)  -  0}  .  For  the 
above  system, 

P{X  -  x}  0  1/74  0  0  7/74  3/74  42/74  21/74  Test  Times 

x(c j )  11  101  0  0  0  t,  =  4 

x(c2)  11  010  1  0  0  t2  -  3 

x(c3)  10  110  0  l  0  t3  -  16 

Table  showing  the  distribution  of  X  and  the  testing  times  for  our 
examp  1 e 

Figure  2 

Each  column  of  the  table  represents  a  joint  performance  x  .  The  top  row 
of  the  table  shows  the  (conditional)  probability  that  X  =  x  for  each  x  . 
The  table  also  shows  the  testing  times  for  each  component. 

The  first  fault  finding  problem  to  consider,  namely  find  the  joint 
state  of  the  components  in  minimum  expected  time,  is  the  exanple  in 


30 


Chapter  I.  The  calcul  ions  are  carefully  explained  there  for  every 
step. 

The  second  model  described  in  this  chapter,  that  is,  determine  with 
probability  one  whether  the  system  will  function  or  not  when  put  in 
service,  is  simplified  by  the  distribution  of  joint  performance.  Since 
the  system  will  a.s.  not  function,  no  tests  are  necessary  and  because 
of  the  requirement  that  a  policy  must  stop  if  possible,  we  see  every 
pol  icy  is  optimal  as  every  policy  must  have  tt(s°)  ■  0  . 

To  specify  the  repair  problem  for  this  coherent  system,  we  need  to 
specify  a  repair  cost  for  each  component.  Figure  3  below  shows  the  set 
R  of  failed  components  realizing  the  minimum  in  equation  3  for  each 
joint  performance  x  with  P(X  -  x)  >  0  .  We  have  taken  r^  =  3  , 
r2  -  2  ,  and  r^  **  l  . 

R-  (c3l  {c33  {c3]  fc2}  [c3  ,  c2} 

x(c,)  1  1  0  0  0  rj  ■  3 

x(c2)  I  0  1  0  0  r2  -  2 

x(c3)  0  0  0  1  0  r3  -  1 


Table  showing  the  set  R  realizing  the  minimum  in 
equation  (3).  Also  shown  are  the  sets  Aj  ,  A2 

and  Aj  which  partition  juu  |  P[Y  *  >  0  | . 

Figure  3 

Notice  the  assumption  that  the  set  R  realizing  the  minimum  of  equation 

(3)  be  unique  is  satisfied  here.  In  Figure  3,  we  have  also  designated 

the  sets  Aj  ,  A2  and  Aj  .  Of  course,  we  identify  each  joint  perfor- 

•> 

mance  x  with  the  u)  f  (0  ,  1}  for  which  uj.  *  x(c.)  ,  i  *  1  ,  2  ,  3  . 
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To  begin  the  algorithm  for  this  example,  we  need  to  compute 


E.(s°)  =  t.  53  =  cju}  A-  (o>)  for  i  =  1  ,  2  ,  3  ,  where 

1  1  u)€Q 

(1  if,  for  co1  =  (<Uj  (1  -  u) i )  .....  ,  we  1 

P{Y  *  cu}  *  0  or  P {Y  =  cuM  =  0  or  and  ou* 


A  j  (co) 


belong  to  the  same  Aj  ,  j  ■  1  ,2,3. 
0  otherwise  . 


We  have: 


E,(s°)  *  4(1/74  +  3/74  +  42/74)  *  92/37 


E2(s°)  =  3(1/74  +  7/74  +  42/74)  =  75/37 


E3(s°)  =16(1/74  +  7/74  +  3/74)  =  88/37 


The  practice  of  keeping  a  record  of  those  cu  for  which  A.(ou)P{Y  =  uu}  >  0 
when  Ej(s°)  is  computed  can  be  followed  here,  as  well  as  for  any  decision 
problem.  This  practice  tends  to  reduce  some  of  the  work  necessary  in 
computing  Ej(s)  for  subsequent  states  s  ,  at  least  in  hand  computations, 
and  is  detailed  in  the  example  of  Chapter  1.  Using  equations  (9)  and  (10) 
of  Chapter  1,  we  begin  with  the  following  three  policy  trees  and  bounds. 


:  °A' 


:=  .ft 


0R| 


L(tf,)  =  688/37  L(<22)  =  671/37  Ktfj)  =  684/37 

U(<?,)  -  759/37  U(tf2)  =  776/37  U(<?3)  =  763/37 

Figure  4 

The  computations  are  fundamentally  no  different  from  those  in  the  examples 
of  Chapter  1,  except  of  course  for  the  choice  of  the  sets  . 

By  following  the  branch  and  bound  algorithm,  the  following  optimal  policy 
is  generated. 
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stop,  k 


Expected  time  to  use  this  optimal  policy  is  692/37  .  k 
is  the  correct  act  for  *  the  random  variable  defined  by  the 
equation  Y  €  A^  a.s. 

Figure  5 


D.  Extensions  to  Problems  with  Modules 

Within  the  study  of  coherent  systems,  the  notion  of  a  module  has 
been  studied.  In  [Bfrnbaum  and  Esary,  1965],  a  module  is  defined  to 
be  a  subsystem  of  a  coherent  system.  Specifically,  a  nonempty  set 
ACC  is  a  module  of  (C  ,  <p)  if  we  can  write: 

i 


v  x  ,  <p(x)  -  t(r(x|A)  ,  x |q_a )  , 


where 


x|^  is  X  restricted  to  the  set  A  (hence  a  joint  performance 
of  the  components  A). 

(A  ,  T)  is  a  coherent  system,  sometimes  referred  to  as  the  module 
rather  than  A  . 

({cA}  U  (C-A)  ,  (r)  is  a  coherent  system,  with  the  pseudo-component 

c  replacing  the  set  A  .  The  specification  for  the  pseudo- 
A 

component  c.  is  given  by  T(XL)  . 
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For  any  coherent  system,  every  singleton  (one  element)  subset  is  a 
module.  For  the  (k/n)  systems  other  than  the  series  and  parallel,  the 
only  modules  are  the  singletons.  For  the  series  or  parallel  systems,  every 
nonempty  proper  subset  of  components  is  a  module. 

Our  interest  in  modules  will  be  to  formulate  a  very  general  fault 

finding  model.  To  extend  one  of  the  models  previously  mentioned,  suppose 

that  M.  ,...,  Mm  is  a  modular  partition  of  C  for  the  coherent  system 

(C  ,  cp)  .  By  this  we  mean  each  M.  is  a  module  of  (C  ,  ©)  and 

Mj  ,...,  is  a  partition  of  C  .  If  (M.  ,  I\)  is  the  coherent 

system  for  module  M.  in  the  definition  of  a  module,  then  our  problem 

might  be  to  determine  the  joint  state  of  these  modular  subsystems  in 

minimum  expected  time,  by  testing  individual  components.  Let  X  denote 

the  random  joint  performance  of  the  components  C  for  (C  ,  cp)  .  Then 

this  problem  is  a  decision  problem,  with  Y.  =  X(c.)  ,  i  *  1  ,...,  n  . 

There  are  2m  A . '  s  ,  one  for  each  joint  state  of  the  random  vector 
j  J 

r,[x|u  )  T  (XL  J.  This  model  reduces  to  a  previous  one  when 

]\  Ml  /  m\  Mm  / 


each  M.  is  a  singleton. 

A  general  fault  finding  model  which  relies  on  modules  is  one  in 
which  we  suppose  there  are  two  sets  of  modules,  the  observable  modules 

Qj  .  Qq  ,  and  the  repairable  modules  Rj  ,...,  Rr  .  The  idea  is  that 

tests  can  be  performed  on  any  of  the  observable  modules  to  determine 
whether  they  are  functioning  or  not;  we  are  limited  to  repairing  or 
replacing  only  repairable  modules.  In  order  to  make  this  problem  well- 
formulated,  assume  that  the  joint  state  of  the  repairable  modules  is  a.s. 
some  function  of  the  joint  state  of  the  observaL>e  modules.  We  can  say 
then,  knowing  the  joint  state  of  the  observable  modules  a.s.  implies  we 
know  the  joint  state  of  the  repairable  modules.  The  decision  problem  is 


KM 
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to  a.s.  determine,  in  minimum  expected  time,  the  joint  state  of  the 
repairable  modules  by  testing  the  observable  modules.  Assume  module  Q. 
requires  time  tj  a  0  to  test,  I  *  1  q  .  For  this  model,  take 

Yj  *  *Q|^XIq|^’  *  *  ^  »•••*  3  where  ]  is  a  typical 

observable  module.  There  are  2r  Aj's  ,  one  for  each  joint  state  of 
the  repairable  modules. 

E.  Computational  Experience 

The  branch  and  bound  algorithm  and  the  dynamic  programming  solution 
to  the  first  fault  finding  model  In  Section  B  above  were  programmed  on 
an  IBM  360  (model  67)  computer.  Briefly,  the  problem  Is  to  determine 
the  state  of  every  component  of  a  coherent  system,  In  minimum  expected 
time.  Again  we  must  remark  that  the  problem  is  specified  by  the  joint 
distribution  of  component  performance;  the  coherent  system  only  provides 
a  framework  in  which  to  pose  the  problem. 

The  cases  listed  below  are  representative  of  those  worked  out. 

The  distribution  of  component  performance  was  nominal  Independence  conditioned 
on  the  system  being  down  (type  ))  or  conditioned  on  the  system  having  just 
gone  down  (type  2).  Thus,  given  a  (k/n)  system  Is  down,  there  are  at 
most  k  -  1  working  components;  given  It  just  went  down,  there  are  exactly 
k  -  1  working  components.  Al)  joint  distributions  were  generated  by  a 
(k/n)  system,  and  a  type  (I  or  2)  of  conditional  distribution.  They  are 
specified  by  k/n  ,  type  In  Figure  6.  These  distributions  are  carefully 
detailed  in  the  Introduction  to  Chapter  3.  Figure  6  gives  a  comparison 
of  the  two  methods  employed.  The  column  "positive  states"  gives  the 
number  of  Joint  performances  x  with  positive  probability,  i.e.  for 
which  P{X  ■  x}  >  0  .  "Cycles"  refer  to  the  number  of  Iterations  the 
branch  and  bound  method  made.  Some  problems  proved  to  be  too  large  for 
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the  branch  and  bound  method  to  work  in  a  reasonable  length  of  time. 


Computation  time,  in  seconds 


joint 

pos itive 

dynamic 

branch 

;tr  ibut ion 

states 

programming 

and  bound 

cycli 

2/3,1 

4 

.10 

.04 

3 

3/3,1 

7 

.12 

.08 

5 

2/4,1 

5 

.69 

.20 

7 

2/4,2 

4 

.69 

.34 

11 

3/4,1 

11 

.91 

4.27 

91 

3/4,2 

6 

.85 

2.30 

49 

4/4,1 

15 

1.04 

54.65 

319 

4/4,2 

4 

.74 

.39 

11 

2/5,1 

6 

4.17 

1.46 

21 

2/5.2 

5 

4.17 

2.65 

36 

4/5,1 

26 

7.16 

not  run 

4/5,2 

10 

5.97 

>  600 

2/6,1 

7 

26.67 

10.79 

60 

2/6,2 

6 

26.92 

23.29 

118 

6/6,2 

6 

27.50 

24.68 

111 

Comparison 

of  Dynamic 

Programming  versus 

Branch  and 

Bound 

Figure  6 


The  dynamic  programming  computation  time  grows  roughly  as  6°  , 
where  n  is  the  number  of  components.  There  is  some  variation  in 
computation  time  for  various  joint  distributions  with  the  same 
number  of  components,  however  the  number  of  positive  states  seems  to 
give  a  rough  indication  of  this  variation,  fewer  positive  states 
leading  to  shorter  computation  times. 

Branch  and  bound  computation  times  are  seen  to  vary  substantially, 
even  for  a  fixed  number  of  components.  While  in  some  cases  this  method 
was  faster  than  dynamic  programming  by  a  factor  of  about  3,  in  other 
cases  it  was  slower  by  several  orders  of  magnitude.  If  a  specific 
problem  was  being  solved  many  times  over  for  different  input  data,  some 
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experience  with  both  solution  methods  would  be  useful  in  determining 
which  one  handles  that  problem  best. 

One  final  observation  to  be  made  is  that,  for  this  specific 
problem,  it  seems  that  the  largest  problem  either  solution  method  could 
handle  in,  say  less  than  30  minutes,  has  eight  or  nine  components. 

This  may  be  a  severe  restriction  on  solving  "real"  problems. 
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CHAPTER  3 

ANALYTICAL  RESULTS  FOR  (k/n)  SYSTEMS 

Introduct ion 

In  this  chapter  we  define  several  fault  finding  models  and  obtain 
some  analytical  results  concerning  them.  The  framework  is  that  of  coherent 
systems,  introduced  in  Chapter  2.  Chapter  3  is  developed  independently  of 
Chapter  1. 

A .  Background 

The  basic  problem  comes  about  by  considering  a  coherent  system  whose 
components  are  subject  to  being  tested.  The  tests  each  require  a  known 
non-negative  time  to  complete,  and  are  performed  sequentially  until 
enough  information  has  been  learned.  What  constitutes  enough  information 
depends  on  the  goal  of  the  testing  process. 

Each  model  is  specified  by  a  coherent  system  (C,  <p)  ,  a  joint 
distribution  for  the  set  of  performance  indicators  (X(c)  |  c£  C}  ,  a 
goal  for  the  testing  process,  either  determining  the  state  of  every 
component,  or  determining  the  state  of  the  system,  and  a  criterion  for 
comparing  pol icies. 

It  Is  convenient  to  index  the  components  In  C  ,  say  Cj  ,  ...  ,  cr  , 

letting  Y(  ■  X(Cj)  and  Y  -  (Yj  Yn).  Our  choice  of  a  coherent 

system  will  be  limited  to  a  k-out-of-n  system,  hereafter  written  (k/n)  . 

Recall  that  a  (k/n)  system  Is  one  of  n  components  which  functions  If  and 

only  if  k  or  more  of  its  components  function.  In  the  above  notation 

®(X)  »  1  <■>  ^  X(c)  i  k  <•>  Yj  i  k  .  In  particular,  (1/n) 

systems  are  known  as  parallel  systems  and  (n/n)  as  series  systems. 

While  independence  is  not  always  an  appropriate  distribution  for  the 

random  variables  (y,  .  Y  }  ,  it  does  play  a  fundamental  role  in  our 

i  n 
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models.  When  we  are  concerned  with  determining  the  system's  state, 
independence  is  assumed.  When  we  seek  the  state  of  all  the  components, 

one  of  two  distributions  is  assumed.  The  first  is  obtained  by  assuming 
that  the  components  nominally  operate  independently,  except  that  we  know 
the  system  is  down  (not  functioning).  Hence  the  actual  distribution  is 
independence  conditioned  on  the  system  being  down.  The  second  is  again 
nominally  independence.  We  suppose  all  components  begin  service  in  an 
operating  state  and  function  independently  for  some  random  lifetime,  each 
with  its  own  distribution.  This  process  continues  until  the  system  fails, 
at  which  time  every  component  is  frozen  at  its  current  state.  We  suppose 
further  that  no  two  components  have  positive  probability  of  simultaneous 
failure.  We  say  the  system  has  just  gone  down,  that  is,  has  at  least  one 
failed  conponent  which,  when  restored,  will  bring  the  system  up  again. 

For  example,  a  series  system  which  is  down  has  one  or  more  failed  components 
while  a  series  system  that  has  just  gone  down  has  exactly  one  failed 
component. 

The  two  criteria  considered  are  mentioned  in  Chapter  1,  the  expected 
time  to  complete  the  testing  process  and  the  probability  that  the  testing 
is  completed  on  or  before  some  deadline. 

A  policy  is  a  rule  which  indicates  which  component  should  be  tested 
next,  based  on  the  results  of  tests  already  made,  provided  any  further 
testing  is  necessary.  In  Chapter  I  policies  were  functions,  giving  a 
component  to  test  or  an  indication  that  testing  could  cease,  for  every 
possible  state  of  our  knowledge  about  the  problem.  While  this  formal 
representation  was  useful  in  Chapter  I,  a  more  informal  (but  equivalent) 
representation  is  used  here.  Namely,  a  policy  will  be  a  certain  kind  of 
directed  graph.  Each  node  is  either  a  stop  node,  meaning  the  testing 
process  can  cease  upon  reaching  this  point,  or  a  test  node,  meaning  more 


testing  is  necessary.  The  stop  nodes  are  labeled  S  and  have  no  arcs 
leaving  them.  The  test  nodes  are  labeled  i,  1  £  i  £  n  ,  indicating 
component  c.  is  to  be  tested  at  this  juncture,  and  have  exact  ly  two 
arcs  leaving  them.  One  arc  is  labeled  0  ,  the  other  arc  I  ,  corres¬ 
ponding  to  the  event  of  finding  component  c.  failed  or  working 
respectively.  Each  node  has  exactly  one  arc  entering  it,  except  a 
single  source  node  corresponding  to  the  beginning  of  the  testing  process. 
Figure  I  illustrates  typical  policies  for  a  (2/3)  system  with  several 
choices  of  a  goal  and  joint  distribution.  The  forward  orientation  of 
each  arc  is  downward. 


(a)  typical  policy  for  finding  the  state  of  all  components  of  a 
(2/3)  system  when  the  joint  distribution  of  component  perfor¬ 
mance  is  nominally  independence,  conditioned  on  the  system  being 
down. 

(b)  same  as  (a),  except  that  the  nominal  distribution  Is  conditioned 
on  the  system  having  Just  gone  down. 

(c)  typical  policy  for  finding  the  system's  state  for  a  (2/3)  system 
when  the  joint  distribution  of  component  performance  is  indepen¬ 
dence. 
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The  reader  should  be  able  to  verify  that  each  node  labeled  S 
really  is  a  juncture  at  which  testing  can  cease.  It  should  be  clear  how 
these  graphs  are  intended  to  be  used  as  policies.  The  component  to  be 
tested  initially  is  indicated  In  the  source  node,  for  example  c^  in 
policy  (a)  above.  Upon  testing  a  component  and  finding  it,  say  failed, 
we  proceed  to  the  node  connected  to  our  present  node  by  the  arc  labeled 
0  and  stop  or  make  a  test  as  indicated  by  the  node  at  our  present 
position.  In  order  to  avoid  useless  complications,  it  is  assumed  that 
a  policy  does  not  test  the  same  component  more  than  once.  Of  course, 
this  does  not  prevent  two  nodes  from  indicating  a  test  of  the  same 
component,  for  example  policy  (c)  of  Figure  I.  These  assumptions  are 
also  made  In  the  policy  definition  of  Chapter  1. 

Policies  (a)  and  (b)  of  Figure  1  illustrate  that  In  some  Instances, 
a  policy  can  be  specified  by  an  ordering  of  the  components.  We  will  refer 
to  policies  which  are  specified  by  orderings  of  the  components  as  sequen- 
tlal  policies.  More  precisely,  a  sequential  pol Icy  is  a  policy  tt  for 
which  there  Is  an  ordering  aj  a^  with  the  property  that  tt  begins 

by  testing  the  component  labeled  aj  ,  continuing  sequentially  to  6^  ,..., 
etc.,  until  enough  Information  has  been  learned  so  testing  can  stop,  or 
until  all  components  have  been  tested,  at  which  time  testing  can  certainly 
cease.  The  order  In  which  components  are  tested  does  not  depend  on  test 
outcomes. 

For  some  fault  finding  models,  every  policy  Is  a  sequantlal  policy, 

(a)  and  (b)  of  Figure  1  for  example,  Including  of  course  any  optimal  policy. 
On  the  other  hand,  not  all  policies  are  sequential,  as  shown  by  policy  (c) 
of  Figure  1.  An  Interesting  feature  of  each  fault  finding  problem  solved 
in  Chapter  3  Is  that  each  one  has  an  optimal  policy  which  Is  sequential. 


41 


B.  Finding  the  State  of  All  Components  in  Minimum  Expected  Time 


The  first  specific  models  treated  in  this  chapter  have  as  their 
criterion  the  expected  time  to  complete  the  testing  process.  This  is  the 
criterion  for  which  the  branch  and  bound  algorithm  of  Chapter  1  is  devel¬ 
oped  and  is  simply  the  expected  value  of  the  total  testing  time  incurred. 
The  total  testing  time  is  the  sum  of  the  times  for  each  individual  test 
made. 

Consider  first  the  problem  of  finding  the  state  of  every  component 
of  a  (k/n)  system,  when  the  distribution  of  joint  component  performance 
is  independence  conditioned  on  the  system  being  down.  Recall  that 


(1)  Y.  *  X(c.) 
Let 


0  if  component  c.  is  failed 
1  if  component  c.  is  functioning, 


n  . 


(2)  t.  =  time  required  to  test  component  c.  , 

(3)  p.  =  probability  (under  nominal  independence)  that  component  c.  is 

1  functioning,  1 

(3a)  q.  -  1  -  Pj  ,  and  let 

(4)  Pf*}  be  nominal  independence  of  the  random  variables  Yj  ,...,  Yn  . 

Then  we  have  P {Y .  !  I]  *p,  ,  i  «  I  .  n  .  If  (C  ,  cp)  is  the  (k/n) 

system,  the  distribution  with  respect  to  which  we  seek  to  minimize  the 

expected  testing  time  is  Pf*  |  cp(X)  =  0}  ,  equivalently  P[-  |  ^Y.  <  k] 

?  __  1  * 


The  only  assumption  made  about  the  parameters  is 

(5)  tj  *  0  and 

(6)  0  <  p  j  <  1  .  i  ■  1  .  n. 

We  will  show  below  that  there  is  an  optimal  policy  for  this  problem 
which  is  sequential.  Furthermore,  the  ordering  to  obtain  this  policy  is 
straightforward;  simply  index  the  components  so  that  t  .q  ./p . ^  t.  +  j q .  +  j/p .+ 
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i  =  l  n-1  ,  and  test  in  the  order  c.  .....  c  until  k-1  workinq 

1  n 

components  are  encountered,  when  testing  can  cease  as  any  remaining 
components  are  almost  surely  failed,  or  until  all  components  have  been 
tested.  This  result  is  proven  by  induction  on  n  ,  the  number  of 
components.  The  substance  of  this  result  is  in  the  fact  that,  after 
testing  a  component,  if  it  is  failed  we  are  faced  with  the  situation  of 
a  (k/n-1)  system,  or  if  it  is  functioning,  the  situation  of  a  (k-l/n-1) 
system,  for  the  remaining  n-1  components.  To  say  this  more  precisely, 
let 


(7)  T  “  expected  test  time  required  of  policy  i:  ,  and  if  tt  begins 

n  by  testing  component  c.  , 

(8)  T  (Y  *  6)  *  expected  remaining  testing  time  required  of  policy  n 

after  the  initial  test  of  Y.  ,  given  Y.  =  6  ,  where 
6  ■  0  or  1 .  1  1 

Lemma  1 :  Let  tt  be  a  policy  for  finding  the  state  of  all  components 

in  a  (k/n)  system,  given  the  system  is  down.  Let  1  <  k  <  n  and 

suppose  policy  tt  begins  by  testing  c.  .  Then 

n 


(9)  Ttt  "  t  j  +  P{Y . 


o  I  £  Y,  <  k}  T  (Y  -  0)  + 
j-1  J  n  ' 


P{Y.  -  1  |  £  Y.  <  k}  T  (Y  -  1)  . 

'  j-1  J  11  ' 

For  6  ■  0  or  1,  the  conditional  distribution 

n 

PC-  I  £  Y.  <  k  ,  Y.  »  6} 
j-1  J 


p(-  |  £v.  <  k  -  6}  , 
j-1 J 

j*i 


n}  as 


namely  the  one  which  results  when  components  fcj  |  j  j*  i  ,  j  -  1  , . . . ,  n] 
form  a  (k  -  6/n-l)  system,  given  the  system  is  down.  In  particular, 
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T^Y.  B  6)  is  the  expected  time  to  test  this  system,  using  as  a 

policy  tt{6)  ,  where  n(5)  ■  the  subgraph  of  tt  beginning  with  the  node 

reached  after  c.  is  tested  and  Y,  ■  6  ,  and  all  nodes  reachable  from 
i  i  * 

it. 

Proof:  Equation  (9)  follows  by  conditioning  on  the  outcome  of  the  first 

test  tt  makes.  To  show  the  second  assertion,  let  A  be  any  event 

defined  on  the  random  variables  {Yj  |  j  i  i  ,  j  *  I  , . . . ,  n}  .  Then 

n  n  n 

P{A  |  £  V.  <  k  ,  Y.  -  6}  -  P£A  j  £  Y.  <  k  ,  Y,  *  6}  *  P{a|  £y.<  k-  6 
j-1  J  ‘  J-l  J  j«l  J 

J»*l  J^i 

by  independence  of  Yj  ,...,  Y n  under  P{*3  .  The  remaining  assertion 

says  that  T^  (Y.  *  6)  can  be  thought  of  as  the  expected  time  to  use 

tt( 6)  ,  a  policy,  to  test  {c j  |  j  +  1 ,  j  ■  I  .  n}  as  if  these 

componer.v.  forced  a  (k-6/n-l)  system,  given  it  is  down.  But  this 

« 

follows  because  the  conditional  distribution  of  performance  indicators 
for  the  coiqponents  [c^.  |  j  +  i  ,  j  ■  1  ,...,  n}  ,  given  [Yj  =  6}  , 

is  as  shown  above.// 

Proposition  1; 

Consider  the  problem  of  finding  the  state  of  all  components  of  a 
(k/n)  system  in  minimum  expected  time,  given  the  system  is  down.  If  the 
components  are  indexed  so  that  tj9j/Pj  s  . ....  n  , 

then  the  sequential  policy  which  tests  Cj  , . . . ,  cr  until  k-1  working 
components  are  found  or  all  components  have  been  tested,  is  optimal. 
Proof;  We  begin  with  showing  the  result  for  (1/n)  and  (n/n)  systems 
directly.  The  case  (k/n)  is  proven  by  induction  on  n  . 

For  (1/n)  ,  or  parallel  systems,  we  see  all  components  are  a.s. 
failed,  so  no  tests  are  necessary. 
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For  (n/n)  ,  or  series  systems,  there  is  only  one  situation  in  which 
any  policy  can  stop  short  of  testing  all  components,  that  is  when  the 
first  n  -  I  tests  each  reveal  a  working  component.  In  this  instance, 
a  policy  can  stop,  since  the  one  untested  component  is  almost  surely 
failed.  In  any  other  circumstance,  all  components  must  be  tested.  Hence, 
a  policy  tt  ,  when  c.  is  the  one  component  which  might  not  be  tested  by 
tt  ,  has 


n  n 

T  -  zJt.  -  t,  P{Y.  “  1  ,  J  i*  i  |  X]Y:<n3 
"  j  =  l  J  1  J  j  =  l  J 

n  n 

■  IC1;  -  t.  (  n  p.)qj  /  (i  -  n  p.)  . 
j-i  J  1  j^i  j  1  j-i  j 


Minimizing  this  quantity  as  i  varies  is  equivalent  to  maximizing  t.q./p.  . 

For  the  sequential  policy  we  propose  as  optimal,  i  ■  n  .  Because  of  the 

assumed  ordering,  it  is  clear  this  policy  is  optimal.  In  fact,  any  policy 

which  does  not  test  c  in  the  eventuality  that  c.  ,...,  c  .  are 

n  '  1  n-l 

working  is  optimal . 

For  (k/n)  systems  in  general,  we  proceed  by  induction  on  n  . 

Basis;  (n  °  2)  See  above  for  proof  of  result  when  k  =  1  or  2. 

Induction:  (n  >  2)  When  k  =  1  or  k  =  n  ,  appeal  to  the  direct  proof  above. 

Suppose  then  that  1  <  k  <  n  .  We  appeal  to  Lemma  1  above  and 
the  hypothesis  of  the  induction.  Among  those  policies  (sequen¬ 
tial  or  not)  which  begin  by  testing  c.  ,  none  is  better  than 
the  sequential  policy  tt.  given  by  i,  1,  2,...,  i  -  1  ,  i  +  1, 
...,  n  .  This  is  because  for  every  policy  tt  which  begins  by 
testing  c.  ,  equation  (9)  holds;  the  quantities  T^(Y.  =  0)  and 

T  (Y.  *  1)  are  both  minimized  by  the  sequential  policy  tt.  , 

TT  I  I 

due  to  the  interpretation  given  them  in  Lemma  1  and  the 
induction  hypothesis.  It  now  remains  to  show  that  it  is 
optimal  to  test  c.  first.  Comparing  the 


*♦5 


sequential  policy  tt.1  given  by  the  sequence 

If  if  2  i  i  ■ « i  i  *  1 1  i^l  i  •  ■  •  |  n  , 

and  the  sequential  policy  rr.  ,  we  see  for  k  >  2  ,  both 

policies  have  the  same  value  because  each  must  make  at 

least  two  tests,  that  is,  T  *  T  .  The  case  of  k  =  2 

rr.1  rr. 
i  i 

is  illustrated  by  Figure  2.  Equations  (10)  and  (11)  are 
similar  to  equat  ion  (9). 


Figure  2 


(10)  T^,-  t,  +  t.  p(y j  =  0  |£y  <  2)  + 

i  j*l  J 

P{Yj  «  0  ,  Yj  *=  0|  jj^(.  <  2)  i  ( Y  j  -  0,  Yj  -  0) 


(11)  T  -  t.  +  t,  P{Y •  =  0  |  £Y.  <  2}  + 
"i  1  '  j-i  J 


n 

P{y,  -  0  ,  Y.  -  0  I  £Yj  <  2}  (Y.  =  o,  Y,  *  0) 


The  last  terms  in  equation  (10)  and  (11)  are  the  same,  since 
policies  tt.  and  tt.  agree  after  two  tests.  Hence 
is  better  than  tt.  if 


i 

TT. 

I 


n 

t  (1  -  P [Y .  =  0  J  £  Y  <  2}  )  - 
'  '  j-1  J 

t.  (I  -  P{Y.  -  0  |  fy.  <2})>0 
1  '  j  =  l  J 

equivalently  if  t ^ q j /p ^  £  t.q./p.  .  Since  this  is  clearly 
satisfied  for  each  i  ,  we  conclude  n!  is  as  good  as  tt.  , 
but  then  as  we  remarked  above,  the  sequential  policy  tTj 
given  by  1  n  is  the  best  policy  which  tests  c^ 

first.  Hence  T  ST  ,  £  "i  ,  showing  that  of  all 

H,  TT;  TT, 

policies  which  make  at  least  one  test,  tt^  is  as  good  as 
any.  Since  any  policy  must  make  at  least  one  test,  we 
conclude  the  sequential  policy  tt^  is  optimal.// 


This  pleasing  result  does  not  continue  to  apply  when  we  change 
the  assumptions  of  the  model  slightly.  Let's  consider  the  problem  which 
is  identical  to  the  one  above,  except  that  the  distribution  of  the  perfor¬ 
mance  indicators  is  nominally  independence  conditioned  on  the  system  having 

just  gone  down.  For  (k/n)  systems,  this  is  just  P{*  |  ^  Y.  =  k  -  1}  , 

j*l  J 


where  again  the  unconditional  P{*)  is  independence  on  the  random 

variables  [Y.  Y  }  .  Recall  we  wish  to  find  the  state  of  all 

I  n 

components  in  minimum  expected  time. 

We  can  obtain,  in  a  special  case,  a  similar  result  for  (k/n) 
systems.  The  technique  for  proving  these  results  is  the  same  as  for  the 
results  obtained  above.  For  this  reason,  we  choose  to  sketch  the  proofs, 
omitting  the  repetitious  details.  The  notation  and  assumptions  of 
equations  (1)  through  (8)  continue  to  apply. 

Consider  first  the  series  system  of  n  components,  the  (n/n) 
system.  Since  one  component  is  failed  and  the  remainder  working,  a.s., 


we  see  any  testing  policy  stops  as  soon  as  a  failed  component  is  found, 
or  n-1  components  are  found  to  be  functioning.  For  this  reason,  every 
policy  is  in  fact  a  sequential  policy,  as  illustrated  in  Figure  3. 


Typical  policy  for  series  system  of  n  components,  given 

system  just  went  down,  a^  . is  a  permutation  of 

1  i . .  •  ,  n  . 

Figure  3 

When  we  speak  of  a  policy  rr  given  by  the  sequence  aj  , . . . ,  , 

we  mean  as  shown  in  Figure  3.  Lemma  1  above  has  its  analogue  here. 

Lemma  2;  Consider  tt  ,  given  by  1  n  ,  as  a  policy  for  finding  the 

state  of  a  1 1  components  of  a  (n/n)  system,  given  the  system  has  just 

gone  down.  Supposing  n  5  2  ,  we  have 

(12)  T  “  tj  ♦PfY,  -  l  |£y  -  "  -  n  T  (Y,  -  I)  . 

J  *1  J 

n 

The  conditional  distribution  Pf*  I  T]  Y .  =  n  -  1  ,  Y ,  *  1 1  is  the  same 

•  i  J  1 

J  =  1 

n 

distribution  on  {Y,  .  Y  }  as  p{*  |£y.  =  n  -  2]  .  Hence 

2  n  j=2  -1 


- T.-J+J- 


*8 


(Yj  =  1)  is  the  expected  time  to  use  policy  n(Yj  =  i)  defined  by 

2  .  n  ,  to  test  the  components  fc2  , ....  c^}  arranged  in  series, 

when  this  system  has  just  gone  down. 

Proof:  The  method  is  the  same  as  used  in  Lemma  1.  Equation  (12)  results 

by  conditioning  on  the  results  of  the  first  test  and  realizing 

Ttt( Y |  =  0)  =  0  .  The  second  assertion  follows  as  easily.  Letting  A  be 

any  event  on  the  random  variables  fY»  , ...,  Y  }  ,  we  have 

*  n 

n  n 

P{A  |  £y.  =  n  -  1  ,  Y.  =  1}  =  P(A  |  X]  Y.  =  n  “  2)  by  independence  of 
j«l  J  1  j“2  J 

of  Yj  .....  Yn  ,  under  P { * }  .  This  model  has  a  simple  rule  for 

determining  the  optimal  policy  in  a  special  case. 


Proposition  2; 


Consider  the  problem  of  finding  the  state  of  all  components  of  a 
(n/n)  system  in  minimum  expected  time,  given  the  system  just  went  down. 
Suppose  n  2  2  and  also  suppose  there  is  an  indexing  of  the  components 
so  that 


(13)  t.p./q.  s  t.+|p|t|/q]+|  ,  i  -  1 

Then  the  policy  given  by  the  sequence 


» ...»  n  >  and  t^_j  ^ 

1  , . . . ,  n  is  opt ima 1 . 


Proof :  The  proof  is  by  induction  on  n  . 

Basis:  (n-2)  Since  any  policy  makes  exactly  one  test,  the  optimal  policy 
will  test  the  component  with  shortest  test  time,  namely  c^  . 


(Not i ce  wa 

are  only  us ing 

tj  ^  t£  .) 

Induct  ion;  (n  >  2) 

The  argument 

proceeds  as  in 

proposition  1 

.  Comparing 

pol icy 

1 

it.  given  by 

1 ,  i  •  2  , . . . , 

i  - 1 ,  i  + 1  , . . . , 

n  ,  and 

pol icy 

tt.  given  by 

ij  1 )  2  i  •  •  •  i 

i  -  1 ,  i+1  , . . . , 

,  n  , 

49 


we  see  T  ■  £  T  <=>  t,p,/q,  £  t.p./q.  .  The  policy  tt, 

tt-  n.  II  1  i  i  i  1 

i  i 

satisfies  T  ^  T  .  ,  using  lemma  2  and  the  hypothesis  of 
"1  ni 

of  the  induction.  Tt  follows  tt^  is  optimal,  because 

T  s  T  and  tt.  is  the  best  policy  which  begins  by 
"l  ni  ' 

testing  c.  .// 

The  (2/n)  system  has  a  similar  result  as  just  obtained  for  the 

series  case.  For  the  (2/n)  system,  the  random  variables  f y ^  .  y^j 

n 

are  nominally  independent,  conditioned  on  Y .  =  1  or  Y.  =  1  for  one  i  . 

i  =  l  '  1 

Every  policy  is  again  sequential,  stopping  as  soon  as  a  working  component 
is  found  or  n-1  components  have  been  tested.  Consider  the  random  vari¬ 
ables  rY,'  ,...,  Y  1  where  Y.'  =  1  -  Y.  .  These  random  variables  have 
the  joint  distribution  of  performance  indicators  for  a  series  system  which 
has  just  gone  down.  Furthermore,  since  knowing  Y  is  equivalent  to 
knowing  Y'  ,  every  policy  for  the  Y  process  is  equivalent  to  a  policy 
for  the  Y'  process  and  conversely.  Since  proposition  2  indicates  a 
special  case  for  which  the  optimal  policy  is  easily  obtained  for  the  Y' 
process,  we  are  lead  to  conclude 


Proposition  3: 


Consider  the  problem  of  determining  the  state  of  every  component  in  a 
(2/n)  system  which  has  just  gone  down.  If  the  components  are  indexed  so 
that 


<JU)  1,1,/p,  stwq.(1/p.t|  , 


I  =  1 


then  the  policy  which  tests  in  order  of  c.  c  until  a  working 

i  n 

component  is  found,  or  ^  has  been  tested,  is  optimal,// 

For  a  (k/n)  system  in  general,  not  all  policies  are  sequential, 
and  indeed  sometimes  no  sequential  policy  is  optimal.  Consider  the 


example  illustrated  by  Figure  4 


i 

t. 

pi 

t  .p./q. 
ri  ^i 

t.q./p. 
i^i  i 

1 

1 

.9 

9. 

.111 

2 

10 

.9 

90. 

1.11 

3 

10 

.95 

190. 

.526 

4 

100 

.95 

1900. 

5.26 

Oata 

table  for 

(3/4)  system 

Policy  tt 


Model  is  finding  state  of  all  components  in  minimum 
expected  time,  given  system  just  went  down.  No 
sequential  policy  is  optimal. 

Figure  4 

We  will  show  that  tt  is  better  than  any  sequential  policy,  and 
in  fact,  that  tt  is  optimal.  Firstly,  since  the  general  form  of  rr 
implies  T  s  21  ,  tt  is  better  than  any  policy  (sequential  or  not)  which 
begins  by  testing  c^  .  Lemma  1  has  its  analogue  for  this  model,  and  is 
proven  just  as  lemma  1  is.  Briefly,  it  says  that  if  the  first  test  finds 
a  failed  component,  then  the  remaining  components  look  like  a  (k/n-1) 
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system,  while  if  the  first  component  tested  is  working,  the  remaining 
components  look,  like  a  (k-l/n-1)  system.  Using  this,  along  with 
propositions  2  and  3,  we  see  that  among  all  policies,  sequential  or  not, 
which  begin  by  testing  c^  ,  none  is  better  than  the  sequential  policy 
given  by  2,  1,  3,  4,  because 

tlp]/«,l  *  t3p3/q3  s  V4/q4  '  tlql/pl  *  l3q3/p3  s  *4q4/p4 

and  tj  ^  t|^  .  The  same  reasoning  shows  that  the  sequential  policy  given 

by  3,  1,  2,  4,  is  best  among  all  policies  which  begin  by  testing  c^  . 

However,  the  form  of  any  sequential  policy  is  such  that  the  order  of  the 

first  two  tests  can  be  reversed  without  changing  the  value  of  the  policy. 

It  follows  that  these  two  sequential  policies  use  the  same  time  as  the 

sequential  policies  given  by  1,  2,  3,  4,  and  1,  3,  2,  4,  respectively. 

Now,  again  using  the  reasoning  above,  the  tt  illustrated  in  Figure  4 

is  as  good  as  any  which  begins  by  testing  Cj  .  Let  tt1  and  tt'  1  be 

the  sequential  policies  given  by  1,  2,  3,  4,  and  1,  3,  2,  4.  It  will 

follow  that  rr  is  ootimal  and  that  no  sequential  policy  is  optimal, 

if  T  <T  ,  and  T  <  T  .  The  following  calculations  are  straight- 
TT  TT1  TT  TT1  1  3  3 

forward. 

4 

11  +  10  (1  -  q,q2  p3p4  -  p,q2  p^)  /  Pf^Yj  =  2} 

Trf  "  n  +  1°  (1  “  V2  P3P4  ‘  p!p2  q3q4>  7  P[£y.  =  2} 

i  =  I  ' 

4 

V  =  n  +■  io  (i  -  q,P2  q3P4  -  p,q2  P3q4)  /  p{£y.  =  2} 

i  =  1 

Ttt  <  V  <«•  p3q3  >  p2/p2 
Tit  <  Ttt"  <=>  p3q3  >P2/C|2 
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There  is  a  special  case,  motivated  by  previous  results,  in  which  a 
sequential  policy  is  easily  obtained  and  is  optimal. 

Propos i t ion  4; 


Consider  the  problem  of  determining  the  state  of  every  component  in 
a  (k/n)  system  which  has  just  gone  down.  If  the  components  are  indexed 
so  that 


(15)  t.p./q.  s  t.t|p.t|/<,m  .  t.q./p,  *  t.  +  |,It|/p.+  |  ,  i  -  1  . n-  1  , 


t  ,  S  t  , 
n-1  n 

then  the  sequential  policy  given  by  the  ordering  1  ,...,  n  is  optimal. 
(That  is,  test  in  the  order  Cj  cn  until  k-1  working  or  n-k+1 
failed  components  are  found,  at  which  time  testing  can  stop.) 


Proof;  The  method  follows  lemma  1  and  proposition  l.  Analogous  to 
lemma  1,  we  see  upon  making  the  first  test,  the  remaining  n-1  components 
appear  to  form  a  (k/n-1)  or  (k-l/n-1)  system,  according  as  the  first 
test  finds  a  failed  or  working  component,  when  2  <  k  <  n  .  The  argu¬ 
ment  proceeds  exactly  as  proposition  1,  by  induction  on  n  .  The 
boundary  case  of  (1/n)  is  simple  as  no  tests  are  necessary;  all 
components  are  a.s.  failed.  The  result  for  (2/n)  and  (n/n)  systems 
is  contained  in  propositions  3  and  2  respectively.  To  complete  the 

induction  step,  we  only  need  to  show  T  ,  £  T  where  rr. '  is  the 

ni  ni  1 

sequential  policy  given  by  the  ordering  l,  i,  2  ,...,  i-1,  i+1  ,...,  n  , 
and  rr.  is  given  by  i,  1,  2  ,...,  i-1,  i+1  ,...,  n  .  But  when 
2  <  k  <  n  ,  at  least  two  tests  are  required  of  any  policy,  so 
T  ,  =  T  .// 

TT.  TT. 

I  I 

C .  Models  Seeking  the  System's  State  in  Minimum  Expected  Time 

For  the  problem  of  determining  the  state  of  a  (k/n)  system,  it  is 
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assumed  that  the  components  operate  independently.  Clearly  it  would  not 
be  reasonable  to  condition  on  the  system  being  down  or  having  just  gone 
down,  since  then  no  tests  would  ever  be  made.  Notice  also  that  indepen¬ 
dence  is  equally  inappropriate  when  seeking  the  state  of  every  component, 
because  such  a  distribution  would  cause  every  policy  to  test  every 
component.  In  this  section,  an  optimal  policy  minimizes  the  expected 
testing  time  used.  The  notation  and  assumptions  of  equations  (1)  through 
(8)  continue  to  apply. 

The  form  of  the  results  and  the  method  of  proof  are  analogous  to  the 

results  obtained  in  the  previous  section.  For  the  (1/n)  ,  or  parallel 

system,  every  policy  is  sequential  and  an  optimal  order  in  which  to  test 

is  c  j  ,...,  cn  whenever  t:  ./q  {  £  t.+1/q.+)  ,  i  =  1  .  n  -  1  .  For  the 

(n/n)  ,  or  series  system,  again  every  policy  is  sequential  and  an  optimal 

order  in  which  to  test  is  c.  ,...,  c  whenever  t./p.  £  t.  ,/p.  ,  , 

1  n  i  ri  i+l  ri+l 

i  *  1  ,...,  n  -  1  .  Similar  to  the  last  model  of  the  previous  section,  for 
the  (k/n)  systems  in  general,  sometimes  no  sequential  policy  is  optimal. 

If  the  indexing  can  be  performed  so  that  both  t./q.  £  t.  ,/q.  .  and 

t./p.  £  tj+]/pj+j  ,  i  =  1  ,...,  n  -  1  ,  then  the  sequential  policy  which 
tests  in  the  order  c^  ,...,  cn  stopping  as  soon  as  k  working  or  n-k+1 
failed  components  are  found,  is  optimal. 

D.  Models  with  a  Deadline  Criterion 

This  section  deals  with  a  criterion  mentioned  only  briefly  in  Chapter  1, 
the  deadline  criterion.  To  work  with  this  criterion,  let 
(16)  =  random  time  to  complete  testing,  using  policy  n  . 

Relating  this  notation  to  previous  notation,  we  have 

07)  Tn  ■  E<Tn>  ■ 

The  deadline  criterion  has  a  non-negative  parameter  t  ,  the  deadline;  an 
optimal  policy  seeks  to  maximize  the  probability  that  t  s  t  . 


54 


The  primary  tool  of  the  following  results  is  contained  in  [Kadane, 
1968]  as  his  theorem  I,  a  variant  of  the  Neyman-Pearson  lemma.  This 
result  is 

Theorem  I: 

Let  {p j }  and  {t j }  be  arbitrary  non-negative  sequences  such  that 

p.  <  ®  .  Let  X  be  the  class  of  sequences  fx.)  such  that 
i  '  1 

0  s  x.  i  I  for  all  i  and  let  b  -  t.  .  If  0  <  t  <  b  ,  then 

i=>P|>0  ' 

the  maximum  of 

(18)  5Z  X,P  5 

I  1  1 

subject  to 

(19)  Ex.t,  *  t 

and  fx,}  €  X  is  attained,  and  it  occurs  when  and  only  when 


(20) 


xi 


1  if  p.  >  r  t. 

0  if  p,  <  r  t( 


for  soma  r  ,  0  <  r  <  •  ,  and 

(21)  E*,t.  "  1 

The  set  of  r's  satisfying  (20)  is  the  same  for  each  optimal  fx.} 
and  is  a  single  point  or  a  closed  interval.// 

Kadane  uses  this  result  in  his  treatment  of  the  bal  I- in-the-box 
problem  with  a  budget  ceiling.  Briefly,  this  problem  consists  of 
searching  a  set  of  boxes  for  a  hidden  ball,  while  admitting  the  possi¬ 
bility  of  a  search  overlooking  the  ball.  An  optimal  policy  maximes  the 
probability  of  finding  the  bail  without  exceeding  a  budget  ceiling. 
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The  first  model,  tested  In  detail,  will  serve  to  Illustrate  how 
theorem  1  Is  used  to  obtain  analogous  results  for  the  remaining  models. 

The  problem  is  to  find  the  state  of  all  components  of  a  (2/n)  system 
for  which  the  distribution  of  joint  performance  is  nominal  independence 
conditioned  on  the  system  being  down.  An  optimal  policy  seeks  to  maxi¬ 
mize  the  probability  of  having  completed  the  testing  process  on  or  before 
the  deadline  t  .  Notice  every  policy  is  a  sequential  policy.  Components 
are  tested  in  some  prearranged  order  until  a  working  component  is  found, 
or  all  components  have  been  tested. 

The  assumptions  and  notational  conventions  of  equations  (1)  through 
(6)  will  continue  to  apply  here.  In  addition,  we  will  require  that  the 
deadline  t  be  small  enough  to  affect  us,  namely  that 

(22)  ft,  >t 

i-1  ' 

Suppose  policy  tt  is  given  by  the  sequence  1  ,...,  n  .  Then,  for  k 
defined  by 

(23)  0  S  t  -  £t.  <  *k+, 

we  have 


(24)  P{t  st  |  t  Y.  ■  I }  ■  P{Y.  ■  1  for  some  i  *  1  ,...,  k|  £  Y.  *  I) 
n  j-1  '  J  i-1  ' 


/  n 

n  \ 

(j5 

(.“j  q')) 

/(E  p, 
\j-i  J 

(n  q.) 

+  n  qj 

i-i  ' 

Wj  7 

We  see  the  problem  is  equivalent  to  finding  an  indexing  of  the  components 


l  J2. 

/  n 

n  \ 

( ^  pj 

V  j-i  J 

In  J 

'(j?.  p7i2j qi) 

+  ,5,  ) 

Wj  7/ 

k 

is  a  maximum,  subject  to  £  t s  t  .  While  theorem  1  does  not  address 

j-i  j 
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itself  to  this  problem  exactly,  the  situations  are  similar.  We  will  use 
theorem  1  in  two  ways  to  make  statements  about  our  problem.  We  remark 
first  that.  In  using  theorem  1,  the  flnlteness  of  our  model  is  no 

OD 

restriction.  Also,  there  is  always  an  optimal  solution  Cx j } j  =  ]  t0 

the  problem  posed  in  theorem  1  with  0  <  Xj  <  1  for  at  most  one  i  .. 

The  first  application  of  theorem  )  provides  a  policy  with  a  bound  on 
the  error  incurred  in  using  it  rather  than  an  optimal  policy.  Specifically, 


Proposition 


Consider  the  problem  of  finding  the  state  of  every  component  of  a 
(2/n)  system,  given  the  system  is  down,  prior  to  the  deadline  t  . 
Index  the  components  so  that 

(25)  tiqi/pi  *  ti+|qi+l/pi+l  1  “  1  .  n  "  1 

and  let  k  be  defined  as  in  equation  (23)  above.  Then,  for  the  policy 
given  by  the  sequence  1  ,...,  n  ,  equation  (24)  holds,  and  further,  if 
rr1  is  any  optimal  policy,  we  have 

(26)  0  *  P{t  s  t  |  £y.  =5  1)  -  P{x  St  |  £Y.  *  1} 

^  i-1  1  "  i-1  1 


*  f  Pi 


(  11 

\i*k+l 


where 

(27)  f 


k  n  n 

(t  -  2Z  t , )  /  t,  .  and  c  -  £  P:  n  +  n  q? 
i-1  1  K+l  j-1  J  ij<j  1  i-1  ' 


k 

In  particular,  if  f  -  0  ,  equivalently  if  J]  t.  ■  t  ,  then  tt  is 

i-1  ' 


optimal. 
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Proof:  Apply  theorem  I,  replacing  p,  by  p,/  n  qtl  /  c  , 

1  Ij^l  J/ 

I  ■  1  , . . . ,  n  .  Then  for 

II  i  <  k  +  I 

f  i  »  k  +  1 

0  i  >  k  +  I 

n 

=  j  Is  an  optimal  solution.  A  value  of  r  generating  this 

solution  is  ("t^  /  ^  |  J1  qi)  /  C)]  *  Ec*uation  (26)  now 

follows  by  realizing  that  an  optimal  policy  to  the  fault  finding  problem 
also  provides  a  solution,  although  not  necessarily  optimal,  to  the 
problem  posed  by  theorem  1.  The  value  of  the  optimal  solution  is 

n 

P{t  5  t  |  £y.  £  1}  +  right  hand  side  of  (26).// 

n  1-1  1 

We  remark  that  the  fault  finding  problem  for  this  model  is  well  known 
as  the  knapsack  problem  and  as  such,  there  are  techniques  available  for 
solving  it.  Proposition  5  merely  provides  quick  access  to  a  sub-optimal 
policy  and  gives  a  bound  on  how  much  better  any  optimal  policy  might  be. 

The  form  of  this  bound  suggests  how  the  model  might  be  expanded  to  allow 
a  partial  test  while  making  suitable  assumptions  about  the  usefulness  of 
such  a  test,  and  how  theorem  1  would  be  used  to  obtain  an  optimal  solution 
in  this  case. 

In  order  to  obtain  such  a  model  to  which  theorem  1  would  apply, 
assume  one  test  can  be  conducted  for  any  length  of  time  up  to  the  nominal 
time  necessary  for  a  complete  test.  Also  assume,,  for  0  s  f  £  l,  and 

i  a  1  m  ••  i  ^  i 

(29)  Pfa  partial  test  of  c.  ,  lasting  ft.,  determines  the  state  of  c.)*f, 
and  the  indicator  of  the  event  mentioned  in  equation  (29)  is  independent 
of  the  joint  state  of  the  components,  (Yj  ,...,  Y  )  .  Now  a  policy  must 
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also  indicate  which  one  if  any,  of  its  tests  is  to  be  a  partial  test, 
and  ho/  long  this  test  is  to  last.  Because  any  tests  concluded  after 
the  deadline  t  are  irrelevant,  as  is  the  order  in  which  tests  concluded 
prior  to  t  are  performed,  there  is  no  loss  of  generality  in  requiring 
the  partial  test  to  begin  just  after  the  last  complete  test  prior  to  t  , 

and  to  end  on  or  before  t  .  Let  tt  be  the  policy  given  by  . a^  , 

with  d  the  duration  of  the  partial  test.  Suppose  there  is  only  time 
for  at  most  k  complete  tests  prior  to  t  ,  equivalently  that 


0  S  t  -  jS  <  t 


i-1  ak+l 


Then,  if  d  >  0  ,  the  partial  test  is  on  c  and  5^  t  +  d  S  t  . 

ak+1  i-1  ai 


Further,  for  f  -  d/t  we  have  when  k  <  n  -  1  , 

*k+l 


(30)  P{t  5  t  |  EY*  3  l]  *  P{a  working  component  is  detected  prior  to 

i-1  ' 

n  k  /  \ 

t  I  ^  1}  -  Ep3  [  n  qa  )  /  c  +  f  P  n  q;  /  c  , 

i-1  '  i-1  ai  aj/  ak+l  j>k+l  J 

and  when  k  -  n  -  1  , 

n 

(31)  P[t  S  t  |  £y.  §  1}  -  P{a  working  component  is  detected  or  all 

i-1  ' 

n 

components  are  found  failed  prior  to  t  |  E  Y-  -  U 

i-1  ' 

-  £  pa  /  n  q i  \  /  c  +  f  (P  n  q,  +  5  q,  )  /  c  , 

i-1  ai  aj  f  \an  j*i  J  j-1  J  / 

where  the  normalizing  constant  c  -  sj'p.  n  q.  +  n  q.  as  above.  The 

ft 1  1  j*i  -I  j«l  J 


reason  for  the  extra  term  in  equation  (31)  is  because  when  k  =  n  -  1  ,  a 
successful  partial  test  will  tell  if  not  already  known,  the  state  of  all 
components.  This  case  is  easily  handled  directly,  since  (31)  shows  the 
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probability  that  a  policy  is  successful  depends  only  on  the  last  index 

a  ;  n-l  comparisons  determine  the  opt imal  choice.  When  k  <  n  -  I  , 
n 

It  is  clear  theorem  1  applies  directly,  giving 
Proposition  6; 

For  the  problem  of  finding  the  state  of  every  component  of  a  (2/n) 

system,  given  the  system  is  down,  prior  to  the  deadline  t  ,  when  a 

partial  test  is  allowed,  index  the  components  as  in  (25),  a  '  let  k 

be  defined  by  (23).  If  k  <  n  -  1  ,  then  for  tt  given  by  1  ,...,  n  , 

with  a  partial  test  of  c^+j  lasting  t  -  j^tj  i  tt  is  optimal.// 

1-1 

Let's  consider  the  (2/n)  system  when  the  distribution  of  component 
performance  is  independence  conditioned  on  the  system  having  just  gone 
down.  Again  every  policy  is  sequential.  Further,  every  policy  makes  at 
most  n  -  1  tests  before  stopping,  so  to  have  the  deadline  effective, 
we  will  assume 

n 

(32)  ^  t ■  _  t.  >  t  ,  j  *  1  ,...,  n  , 
i-1  '  J 

in  lieu  of  (22).  Propositions  5  and  6  have  their  analogue  here.  The 
proofs  are  omitted. 

Proposition  7: 

Consider  the  problem  of  finding  the  state  of  every  component  of  a 
(2/n)  system,  given  the  system  just  went  down,  prior  to  the  deadline  t  . 
Index  the  components  so  that  (25)  holds,  and  let  k  be  defined  by  (23). 
Then  for  tt  given  by  the  sequence  1  ,...,  n  , 

(33)  p{ts  t  |  £y.  -i}-£p./nq|\  /  Ep:  n  q.  , 

"  1-1  '  1-1  '  \j*i  J /  i-1  '  j*i  J 

and  if  tt1  is  any  optimal  policy,  we  have 
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(34)  0  £  P (r  ,  3  t  |  £>.  -  1}  -  P(t  S  t  |  E  Y.  -  1} 

"  i-1  '  n  i-1  1 


)n 

/£  p,  n  q, 

i-i  '  }t\  j 

k 

L  V  f  \  /  t  •  In  Particular,  if  E  t 

V  ’  i-i  1  j  k+l 


where  f 


then  tt  is  optimal.// 
Similarly  we  have 


josition  8: 


For  the  problem  of  finding  the  state  of  every  component  of  a  (2/n) 
system,  given  the  system  is  down,  prior  to  the  deadline  t  ,  when  a  partial 
test  is  allowed,  index  the  components  as  in  (25),  and  let  k  be  defined 


by  (23).  If  k  <  n  -  2  ,  then  for 
partial  test  of  ck+j  lasting  t  - 


tt  given  by  1 
k 

£t,  *  "  is 

1-1 


.  n  ,  with  a 

optimal .// 


We  will  treat  the  series,  that  is  (n/n)  system  only  briefly.  Notice 
that  if  Yj  ,...,  Yn  have  the  joint  distribution  for  a  series  system, 
given  the  system  just  went  down,  then  for  Y.  -  1  -  Y.  ,  i  -  1  ,...,  n  , 
the  Y'  variables  have  the  joint  distribution  of  a  (2/n)  system,  given 
the  system  just  went  down.  This  is  the  same  transformation  used  in 
Section  B  above  for  the  same  purpose.  It  allows  us  to  conclude  that  for 
a  (n/n)  system,  given  the  system  just  went  down,  Propositions  7  and  8 
hold,  with  the  interpretation  that 
(35)  P{Y.  -  1 }  -  q j  and  Pj  -  1  -  q,  . 

The  problem  of  finding  the  system's  state  for  series  or  parallel 
systems,  when  the  components  operate  independently,  is  treated  just  as 
the  above  models.  One  application  of  theorem  1  yields  a  sub-optimal 
policy  and  a  bound  on  how  much  better  an  optimal  policy  might  be,  while 


the  second  application  involves  the  concept  of  a  partial  test,  and  an 
optimal  policy  in  this  case. 

Consider  first  the  series  system  of  n  components.  To  find  the 
system's  state,  one  must  test  until  a  working  component  is  located,  or 
all  components  have  been  tested,  so  every  policy  is  sequential.  To 
make  the  deadline  effective,  it  is  assumed  (22)  holds,  that  is,  there  is 
not  enough  time  to  test  all  the  components  prior  to  t  .  Let  policy  tt 
be  given  by  a^  , a  ,  and  let  k  denote  the  maximum  number  of 
tests  tr  can  complete  prior  to  t  .  Then 

k  k 

(36)  P{t  £  t]  ■  1  -  n  p  and  £  t  s  t  . 

n  j-1  aj  i=l  ai 

The  problem  of  finding  a  n  which  maximizes  (36)  is  seen  to  be  equivalent 
to  seeking  an  indexing  of  the  components  which  maximizes 


&-1’ 


subject  to  t.  S  t  .  Theorem  1,  used  here  just  as  in  proposition  5, 


yields 


Proposition  9' 


Consider  the  problem  of  finding  the  system's  state  for  a  series  system 
of  n  independent  components,  prior  to  a  deadline  t  .  Index  the  com¬ 
ponents  so  that 

(37)  t j/-Ln  (p.)  5  t |+]/“Ln(p j+j)  ,  i  -  1  .....  n  -  1  . 

Let  the  policy  tt  test  in  the  order  of  1  ,...,  n  ,  and  let  tt'  be  any 


optimal  policy.  Then 
(38)  0  §  P(t  ,  S  tJ  -  p{t 


-*l},(j5,pj)('  -(pk+',f)  ■ 
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where 

k 

(39)  Olt-g  t,  <  tk  , 

i-1  1  K  1 


and 


In  particular,  if  f  ■  0  ,  then  tt  is  optimal. 


The  second  use  of  theorem  1  involves  the  concept  of  a  partial  test, 
however  here  the  assumptions  concerning  such  a  test  differ  from  those  made 
above.  The  assumption  of  (29)  is  replaced  by,  for  0  S  f  S  1  and 
i  =  l  , . . . ,  n  , 

(41)  P{a  partial  test  of  Cj  ,  lasting  t.  ,  determines  the 
state  of  c  j }  -  (P{)f"’1  . 


and,  as  before,  the  indicator  of  the  event  mentioned  in  (41)  is  independent 
of  the  joint  state  of  the  components  (Yj  Y  ) .  It  follows  that,  if 

k  <  n  -  1  ,  P{test ing  Cj  ck  ,  then  a  partial  test  of  ck+J  , 

lasting  f t k+  ^  ,  determines  the  system's  state]  =  1 


Using  the  same  technique  as  above,  we  have 


Proposition  10: 

For  the  problem  of  finding  the  system's  state  of  a  n  component 

series  system,  with  independent  components,  prior  to  the  deadline  t  , 

when  a  partial  test  is  allowed,  index  the  components  as  in  (37),  and  let 

k  be  defined  by  (39).  If  k  <  n  -  1  ,  then  for  tt  given  by  I  .  n  , 

k 

with  a  partial  test  of  ck+]  lasting  t  -  J^t.  ,  n  is  optimal. 

The  case  of  a  parallel  system  is  handled  by  the  transformation 

Y.  ■  1  -  Y.  as  was  done  above.  The  conclusion  is  that  propositions  9  and 
ii 

10  hold  for  parallel  systems,  provided  p.  is  interpreted  as  p{y.  =  0}  . 
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SUMMARY 

Chapter  1  deals  with  a  class  of  combinatorial  problems  called 
"decision  problems."  We  are  given  n  binary  random  variables  and  a 
function  of  the  vector  of  random  variables.  The  object  is  to  determine 
the  value  of  this  function  by  successively  testing  the  random  variables 
to  leern  their  value,  and  then  to  stop  when  enough  information  is  known 
to  infer  almost  surely  the  function's  value.  Each  random  variable  — 
requires  some  known  time  test;  an  optimal  testing  policy  determines 
the  function's  value  in  minimum  expected  time. 

Two  solutions  are  proposed,  a  branch  and  bound  solution  and  a 
dynamic  programming  solution.  The  functional  equation  arising  from 
dynamic  programming  requires  3°  evaluations.  However,  the  usual  bonus 
(in  dynamic  programming)  or  having  solved  the  original  problem  for  every 
possible  initial  state  occurs  here,  and  so  we  have  really  solved  a  bigger 
problem. 

The  branch  and  bound  solution  first  partitions  the  set  of  all 
policies  into  classes.  Then  it  computes  a  lower  bound  on  the  value  of 
a  policy  over  policies  in  each  specific  class.  The  class  with  the  smallest 
lower  bound  is  itself  partitioned.  The  algorithm  terminates  when  a  class 
containing  essentially  one  policy  is  chosen  for  partitioning,  since  for 
such  classes,  the  lower  bound  is  in  fact  equal  to  the  value  of  that 
policy.  Since  the  value  of  the  policy  chosen  this  way  is  a  lower  bound 
on  the  value  of  every  other  policy,  it  is  optimal.  For  the  solution 
presented,  the  lower  bounds  are  computed  by  adding  to  bounds  previously 
computed,  reducing  some  of  the  computational  complexity.  Also,  formulas 
are  given  for  computing  an  upper  bound  on  the  value  of  an  optimal  policy. 
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Using  this,  some  classes  of  policies  can  be  excluded  directly,  whenever 
their  lower  bound  exceeds  the  upper  bound,  reducing  some  of  the  storage 
used  by  the  algorithm. 

Another  criterion,  the  probability  that  a  policy  completes  its 
testing  prior  to  some  deadline,  Is  also  discussed  briefly. 

Chapter  2  introduces  coherent  systems  and  gives  some  fault  finding 
problems  for  them  which  can  be  handled  by  the  methods  of  Chapter  1,  for 
example,  finding  the  state  (functioning  or  failing)  of  every  component, 
or  finding  the  state  of  the  system.  Both  algorithms  of  Chapter  1  were 
programmed  for  a  computer  to  solve  the  problem  of  finding  the  state  of 
each  component  of  the  system.  A  comparison  of  the  computation  time 
used  by  both  methods  on  some  problems  is  included.  It  can  be  seen  that 
neither  method  is  uniformly  better  than  the  other  one  for  every  problem. 
While  branch  and  bound  seems  to  be  largely  unpredictable,  average  compu¬ 
tation  time  for  dynamic  programming  grew  roughly  as  6°  .  Since  the 
functional  equation  was  evaluated  3°  times,  it  appears  that  the  time 
to  evaluate  this  equation  was  proportional  to  2n  . 

The  final  chapter  contains  a  few  of  the  models  from  Chapter  2 
which  could  be  treated  analytically.  The  coherent  system  in  every  case 
is  a  (k/n)  system,  that  being  one  which  functions  if  and  only  if  k  or 
more  of  its  n  components  function.  The  problems  of  determining  the 
system's  state  when  the  components  operate  independently,  and  of  deter¬ 
mining  the  state  of  all  components  when  we  condition  on  the  system  being 
down  or  having  just  gone  down,  are  discussed.  When  an  optimal  policy  can 
be  identified,  it  turns  out  to  be  a  sequential  policy,  i.e.,  one  for  which 
the  components  are  tested  in  a  predetermined  order  until  testing  can 


cea  >e. 
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